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APPENDIX  A 


A1 


Minimal  Root  Sensitivity  in  Linear  Systems 

(IFAC  Workshop  for  Applications  of  Nonlinear  Programming  and  Optimal 
Control,  June  20-22,  1983,  San  Francisco.) 

Robert  E.  Skelton  and  David  A.  Wagie 
Purdue  University 

School  of  Aeronautics  &  Astronautics 
West  Lafayette,  Indiana  47907 


Abstract 

A  lower  bound  is  derived  for  root  sensitivity  and  an  explicit  cri¬ 
teria  for  achieving  this  minimum  Is  given.  Secondly,  an  optimal  output 
feedback  control  problem  is  discussed  which  penalizes  an  index  related 
to  root  sensitivity. 


i 

i 


1.0  Introduction 


A2 


The  modal  data  for  physical  systems  is  rarely  well  known.  This  can 
make  stability  predictions  unreliable  in  feedback  control  problems  and 
can  make  the  behavior  far  from  the  analytical  predictions.  This  paper 
documents  the  smallest  possible  sensitivity  of  eigenvalues  x^  with  respect 
to  the  independent  plant  parameters  in  linear  systems  of  the  form 


x  *  Ax  ,  XEK 


(1) 


ax. 


That  is,  the  norm  of  and  the  lower  bound  of  its  norm  are  of 
interest.  Secondly,  a  metric  related  to  root  sensitivity  is  added  to  the  optimal 
output  feedback  problem  to  achieve  a  compromise  between  performance  and 
root  sensitivity. 

The  norm  of  a  matrix  shall  be  denoted  by 


ICOIIq- tr  .  tr  [-]  *  trace  [•] 

and  the  norm  of  a  vector  shall  be  denoted  by 


(2) 


IIHIIq  =  (•)  Q(-) 


(3) 


where  *  denotes  complex  conjugate  transpose.  Results  herein  are  limited 
to  the  case  of  distinct  eigenvalues  for  A. 


2.0  The  Construction  of  a  Root  Sensitivity  Metric 


>th 


The  sensitivity  of  the  i  eigenvalue  ax^/aA  is  a  nxn  matrix  denoted 
A 

by  S.j  *  aXj/aA.  The  norm  of  S^  from  (3)  is 


,2  a  n  n  ax4  ax,  * 

1*1 15 =  tr  Wi  -  I  I[^  Kjji  no,  (4) 

a“l  6=1  arow  MSrow  aB 


Brow 


aB 


1 


where  Aarow  *  (A^,  ....  Aq  ),  and  1  0-  The  weighting  matrix 

Q.  is  to  be  chosen  larger  than  Q  •  If  root  shifts  in  x,  are  of  greater 

concern  than  roots  shifts  in  X*.  Within  the  matrix  Q^,  the  weight 

th 

is  to  be  chosen  larger  than  Q.  if  the  parameters  of  the  a  row  of  A 

.  1 03 

(these  are  associated  with  xq  in  (1))  are  more  uncertain  than  those  of 

the  eth  row  of  A.  (Hence,  a  logical  choice  for  is  a  value  propor- 

aa 

tional  to  the  variance  of  parameters  Aarow).  Finally,  the  complete  root 
sensitivity  metric  of  interest  is 

e  =  V  IIC  II  C  &  V  II  _ 111  C  f  C\ 


J,  W, 


Thus,  from  the  point  of  view  of  robustness,  a  system  design  with  a  large 
value  of  s  might  be  considered  less  desirable  than  a  system  design  with  a 
small  value  of  s. 

3.0  Computation  of  the  Root  Sensitivity  Metric 

It  is  assumed  that  A  has  a  linearly  independent  set  of  eigenvectors 


Ae _*  =  e^  x,j  — ,  i  *  1,  ...,  n 
The  reciprocal  basis  vectors  ^  are  defined  by 


E  ■  Cei . ■ E_1  > hence  ei’ej  *  sij  <7) 


Multiplying  (6)  from  the  left  by  ,  using  (7),  yields  the  eigenvalues 
in  terms  of  A,  its  eigenvectors,  and  its  reciprocal  basis  vectors. 


A4 

Differentiation  of  the  scalar  (8)  with  respect  to  A  provides  the  required 
sensitivity  ax^/3A.  To  derive  this  result  two  identities  from  linear 
algebra  are  required 


tr  AB  =  tr  BA 

(9) 

(tr  AB)  =  (tr  BA)  =  B^ 

(10) 

where  (9),  (10)  hold  for  real  or  complex  matrices  8  and  A,  and  (10)  holds  if 
and  only  if  the  elements  of  A  are  independent.  Hence  from  (8),  using  (9),  (10), 

a)T  =  Iff  ttr  A*ei  *i  =  <ei  0T  *  ViT  Ol) 

where  ~  denotes  complex  conjugate.  The  weighted  norm  (4)  may  now  be  ~ 
wri tten 

IMS,  ‘  tr  SlVl  *  tr'ViT>*  Q,(*|  e,T)  •  l|t1llqjl|e1ll  2  (12) 

where  the  last  equality  requires  use  of  identity  (9)  again,  and  where 

IMq  *  Ai*QAi  »  ilM2  =  ei*e1  03) 

One  may  note  the  similarity  between  (11)  and  the  weak  differential  of  X i 
given  by  Eq.  (6.2.3)  of  [l  ].  Also  note  the  similarity  between  (12)  and 
the  upper  bound  of  the  weak  derivative  of  x ^  provided  on  the  top  of  p.  235 
in  [l  ].  This  paper  seeks  lower  bounds  rather  than  the  upper  bounds  of 
[l  j.  Otherwise  the  nature  of  the  results  are  similar  (see  theorem  6.2.4 
of  [1]). 

Note  also  that  without  loss  of  generality  eigenvectors  e^  may  always 
be  normalized  to  unit  length.  In  which  case  (12)  may  be  written 

II Mq,  ■  IMq  >  IM2'1  •  (M) 


KB* 


1$ 


>1 

v, 

¥ 


1 


‘**1 

% 

■s 

V 
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*!' 
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Define  by  use  of  the  Cholesky  [1]  decomposition  of  Q,  =  $!  (K » 

ri  2  Vi  •  «i  ■  4  a. 

Then  (12)  may  be  written 

ns,njj  *  ii*,Hq  ii*,h2  -  ii >-1  ii 2  im2 

The  Schwartz  inequality  [l  ]  holds  for  any  two  vectors  ,  e.. 

<111,11  lie,  II 

Since  the  particular  vectors  e^  are  related  by  (8), 


t,  e,  -  1 


(17)  and  (18)  lead  immediately  to 


lie,  II  lie, II  »  1 

From  (15)  note  also  that 


Hence , 


110,^11  =  ||ri|i 


110,11  II*,  II  1  llr,  || 


Multiply  (19)  by  ||2^||  and  use  (20)  to  obtain 


(20a) 


(20b) 


110,11  ||*, ||  lie,!!  >  Ur, ||  He, ||  >  ||a,||  (21) 

Squaring  both  sides  of  (21),  using  the  fact  ||2, ||2  =  tr^j^,  =  tr  =  ||Q^ || 


leads  to 


Ik, II2  II®, !r  i  Hq,  II 


The  equality  in  (19),  and  hence  in  (22),  holds  if  and  only  if  i.  and  e^  are 

col  inear  (i,.  =  e-  for  normalized  e^  ),  [l  ].  From  linear  algebra  [2],  £.=6^ 

if  and  only  if  A  is  normal  (AA*  =  A*A).  Thus,  these  results  are  summarized  as 
follows. 

Theorem  1 

Let  (X..,  e^ ,  *.)  be  the  i^  eigenvalue 3  eigenvector  and  ite  reciprocal 
basis  vector  associated  with  the  real  matrix  A.  If  A  has  a  linearly  inde¬ 
pendent  set  of  eigenvectors  e^,  1=1,  ....  n,  then 

3  X  2 

\\j^\  i  110,11  (23) 

where  the  lower  bound 

II^-Hq,  ■  HOfll 

is  achieved  (for  f  OJ  if  and  only  if  AA*  =  A*A. 

Corollary  to  Theorem  1: 

The  sensitivity  metric  (5)  is  bounded  from  below  by 

S  >  1  110,11  (24) 

i=l 

and ,  for  the  case  Q.»  f  0  (i=l  ,2, .. . ,n) ,  the  minimum  sensitivity 
n  1 

$  "  ^  ,|Q1M  is  achieved  if  and  only  if  A  is  normal  (AA*  =  A*A). 

The  corollary  provides  necessary  and  sufficient  conditions  for  minimum 
root  sensitivity.  The  next  section  suggests  a  means  to  incorporate  this 
information  into  the  output  feedback  control  design  problem.. 


A7 


4.0  Output  Feedback  Design 

It  has  been  shown  [  3]-[  5]  that  the  necessary  conditions  for  the 
output  feedback  for  the  system 

x  =  Ax  +  Bu  +  Dw  E(y)  =  0, 

y  =  Cx  ^<v(t)>  (wT(t)>  vT(t))  ’  [“  v]  5  (t"t) 

z  =  Mx  +  v 

u  =  Gz  xT(o)  -  0,  t  >  0  (25) 


to  minimize 

V‘  11m  E(||y  II  g  +  ||u  II  |) 
t->°°  V 

=  tr  P(CTQC  +  MTGTRGM)  +  tr  VGTRG  (26  b) 

are 

0  *  K(A+BGM)  +  (A+BGM)TK  +  MTGTRGM  +  CTQC  (27) 

0  =  P(A+BGH)T  +  (A+BGM)P  +  BGVGTBT  +  DWDT  (28) 

0  =  RGMPM1  +  RGV  +  BTKPMT  +  BTKBGV  +  V  (29) 


where  ^  =  0  for  the  standard  measurement  feedback  problem  in  [3  ]-[  5  ]. 
Various  suboptimal  strategies  for  approximating  the  solution  of  (27)-(29) 
may  be  found  in  the  literature  [4  ]-[5  ]. 

Parameter  sensitivity  has  long  been  a  concern  in  optimal  control. 
Some  authors  [6  ]-[7  ]  have  suggested  modifying  the  performance  index 
(26a)  by  the  addition  of  trajectory  sensitivity  terms  z|l|pH|2  (where 
p^,  1  =1,  ...  ,r  represent  the  uncertain  parameters).  The  resulting 
computational  burdens  are  very  great  indeed,  since  the  dimension  of  the 


constraint  (state)  equations  becomes  n(l+r).  There  is  also  the  fact 
that  minimizing  output  sensitivity  does  not  necessarily  keep  root 
sensitivity  small.  Hence,  the  concerns  of  stability  in  the  presence 
of  parameter  uncertainty  are  not  addressed  by  trajectory  sensitivity  - 
methods.  In  Section  3.0  we  showed  that  root  sensitivity  is  minimized 
when  A  is  normal.  Motivated  by  Section  3.0,  we  note  that  some  of  the 
shortcomings  of  the  trajectory  sensitivity  methods  are  therefore  avoided 
by  minimizing  an  "abnormality"  index  related  to  root  sensitivity.  It  has  also 
been  shown  [8]  that  the  robustness  bound  for  a  certain  class  of  parameter 
errors  is  maximized  when  the  plant  matrix.  A,  is  normal. 

Motivated,  therefore,  by  Theorem  1  and  [8]  we  pose  a  new  performance 
index  for  optimization  that  includes  an  "abnormality"  penalty  ^ 

r  =  lim  E(||y||n  +  ||u|||)  +  6  ||(A+BGM)(A+BGM)T-(A+BGM)T(A+GBM)||2,  (30) 

t-»«  ^  K  C  QC 

When  8  is  much  smaller  than  the  norms  of  Q  and  R  the  solution  tends  toward 
the  standard  optimal  control  result  (27)- (29)  with  ^  =  0.  On  the  other  extreme, 
when  6  is  chosen  much  larger  than  the  norms  of  Q  and  R,  the  closed- 

loop  system  approaches  the  smallest  possible  root  sensitivity.  (From 
Theorem  1  and  its  corollary  note  that  root  sensitivity  is  minimized  if  and 
only  if  the  latter  term  in  (30)  is  zero.  Other  choices 

of  weights  on  the  matrix  norm  may  be  chosen  besides  CTQC.  This  choice 
is  suggested  only  to  make  the  sensitivity  weight  CTQC  the  same  as  the 
state  weight  in  y^Qy  =  x^[C^QC]x. 

Using  the  same  matrix  norm  as  in  previous  sections,  and  defining 
A  =  A+BGM,  V'  becomes 


n  3 


V  =  tr  F(CTQC  +  MTGTRGM)  +  tr  VGTRG  + 

3  tr[(AAT  -  ATA)]CTQC[(AAT  -  ATA)  ]  (31) 

The  necessary  conditions  for  the  optimum  G  are  obtained  by  augmenting  the 
constraint  equation  (28)  [which  defines  the  P  in  (26)  and  (29)]  to  (31  ) 
via  Lagrange  multiplier  matrix  K  and  differentiating  the  augmented  l/'  with 
respect  to  P,  K,  and  G.  The  equations  (27)  -  (29)  result,  with  the  fol¬ 
lowing  definition  of  \j>: 

*  &  6BT{[(AAT  -  ata)ctqc  +  ctqc(aat  -  ATA)]A 

-  A[(AAT  -  ATA)CTQC  +  CTQC(AAT  -  ATA)]}M‘  (32) 

These  results  are  summarized  as  follows: 

Theorem  2 

The  necessary  conditions  for  minimizing  (30)  subject  to  the  constraints 
(25)  are  given  by  (27)  -  (29)  and  (32). 

The  following  conclusion  should  also  be  clear,  since  for  an  arbitrary 
A  the  matrix  A+BGM  can  be  made  normal (by  choice  of  G)only  if  rank  B  = 
rank  M  =  n. 

Theorem  3 

Suppose  Q,|  =  I,  i  =  1,  ...»  n  in  (5).  The  minimum  sensitivity  s  =  n 
can  be  guaranteed  by  output  feedback  control  for  arbitrary  A  if  and 
only  if  rank  B  =  rank  M  =  n.  Furthermore ,  the  control  gain  in  this  case 
is  not  unique .  Two  gains  that  provide  minimum  sensitivity  are 


Proof: 

Substitute  (33),  (34)  into  the  normality  condition  for  minimum  sensitivity 

(A  +  BGM)(A  +  BGM)T  -  (A  +  BGM)T(A  +  BGM)  =  0  (35) 

to  see  that  the  condition  (35)  holds. 

5.0  Application  of  Closed-Loop  Root  Sensitivity  Design 
Example  1 : 

The  pitch  motion  of  a  rigid  aircraft  is  governed  by,  [9], 


a  h  i  ("y  r '  v 

W  K2  °J\v  l»o\ 


where  a  is  the  angle  of  attack,  q  is  the  pitch  rate,  u  is  the  elevator 

angle,  t  is  the  lifting  time  constant,  u>0  is  the  undamped  pitch  natural 

frequency,  and  Q  is  the  elevator  effectiveness.  In  the  open-loop  case, 

(u+w)=  0,  we  are  interested  in  the  root  sensitivity  properties  of  (36). 

Note  from  the  corollary  to  Theorem  1  that  minimum  sensitivity 
2  3Xa  2 

s  *  r  II  II  *  ^  is  achieved  if  and  only  if  A  is  normal.  Computing  the 
"abnormality"  matrix  (AA*  -  A*A) yields 

. I  ^1+“bz>l 


AA*  -  A*A  * 


.^1+“02)  “o4-1 


Thus,  root  sensitivity  takes  on  Its  absolute  minimum  when  fo0*l#  -  =  0. 
This,  of  course.  Is  not  a  practical  possibility  for  the  aircraft.  Now  con¬ 
sider  the  output  feedback  design  of  Section  4. 


VS.'- 


Example  2:  Let  the  angle  of  attack  measurement  be  made  z  =  a+v  where  Ev  =  0, 


EMt)v(x)]-  6 ( t ”T )  describes  the  white  measurement  noise  v  and  Ew  c  0 
E[w(t)w(x)] =6(t-x)  describes  the  white  actuator  noise  w.  For  the  system  (36) 
design  a  measurement  feedback  control  law  for  regulating  u  such  that 

1/  =  lim  {E(q2+pu2)  +$  p+BGM)  (A+BGM)1- (A+BGM)T(A+B6M)  ||2T  }  (38) 
t-*®  C  C 

I 

is  minimized.  Assume  Q  *  1,  Q  *  I,  and  C  =  [0,  1]. 

The  solution  is  provided  by  (29),  where  P  is  obtained  from  (28), 


P  = 


1 


1  ^  +  uq2t(1-G) 


a jJ  (G2+l) 


“0 

ITug) 


(39) 


and  K  is  obtained  from  (27)  (For  this  example  R  =  p) 


K  = 


^CG2pt-u02t(G-1)] 


■1/2 


-1/2 

pG2x2+l  -  U)02T2(G-1) 
2tu,02(1-G) 


W 


For  the  aircraft  example,  (29)  yields  a  fifth  order  equation  in  G  as  a 

2  2 
function  of  S,  p,  wQ  and  T*  For  given  values  of  p,  S,  u  and  T  this 

equation  will  yield  five  candidate  values  of  G.  The  optimal  G  is  that 

root  that  minimizes  the  cost  function.  In  the  present  casei  G  <  1  Is 

required  for  stability.  Tf  none  of  the  real  G  are  <  1,  then  analysis 

of  this  problem  shows  that  a  decrease  in  B  will  drive  G  toward  more 


stable  values.  After  changing  6,  the  equation  can  then  be  solved 
once  again  for  G.  This  iteration  is  repeated  until  a  stable  (G  <  1) 
solution  is  achieved.  (One  may  also  increase  p  to  seek  stability, 
although  the  direction  of  change  of  p  required  depends  on  e). 

Assume  a>0  =1 ,  t=.7  (this  corresponds  to  damping  ratio  =  .71).  Set 
ting  8=0,  p=l  yields  the  standard  optimal  measurement  feedback  control 

G  =  -0.118  (41) 

and  setting  e =  p  <  »  yields  the  optimally  sensitive  design 

G  =  2.0  .  (42) 


This  choice  of  G  in  (63)  forces  the  closed-loop  system  matrix  to 
be  synmetric 


A  +  BGM 


-1/x  1“ 

“l/T 

l" 

-“o2+“o2g  °- 

.1 

0_ 

r 


(43) 


and  by  Theorem  1  and  its  corollary,  the  sensitivity  is  at  its  minimum  in 
this  case.  Note,  however,  that  stability  is  lost  by  this  minimum  sensitivity 
design,  (G  <  1  is  required  for  stability).  Thus,  minimally  sensitive 
designs  might  not  be  stable. 

Figures  1  and  2  show  the  tradeoff  between  output  performance 

V  =  11m  E||y||?  versus  the  control  effort  y  =  lim  E[| u|| ^  (Fig.l) 
y  t—  g  *  *  t-*~  K 

and  abnormality  of  A  =  (A+BGM),  -  j|  AA  -A  A  HcTqc»  versus  the  control 

2 

effort  (Fig. 2).  For  both  figures  u  =  1,  t*.7,  8  and  p  vary.  In  the  stan¬ 
dard  output  feedback  design  (bx0),  the  output  performance  is  improved  with 
an  increase  in  control  effort  (Fig.  1)  whereas  the  abnormality  index  greatly 
increases  with  control  effort  (Fig.  2  with  8  =  0).  Note  also  that  8  >  2  is  not 


desired,  since  larger  values  of  6  do  not  yield  substantially  larger 
abnormality  reductions  (Fig.  2)  but  do  accelerate  the  degradation  of 
the  nominal  output  performance  (Fig.  1). 


Increasing  p 


»„  4  i~  E  I'  “  Ur 


Figure  1.  Output  Cost  (Vy)  vs  Input  Cost  ((/„)  [t=.7 


«v,w«wt'.v.wsvnp,v 


5.0  Conclusions 

Explicit  expressions  for  a  scalar  metric  of  root  sensitivity  is 
given  in  terms  of  the  left  and  right  eigenvectors  of  the  system,  so  that 
sensitivity  of  each  eigenvalue  with  respect  to  the  plant  matrix  may  be 
readily  computed.  A  necessary  and  sufficient  condition  for  minimum  root 
sensitivity  is  that  the  plant  matrix  of  the  state  equations  be  normal. 

However,  root  sensitivity  alone  is  not  a  sufficient  design  goal. 
Thus,  an  "abnormality"  term  is  added  to  the  traditional  quadratic  per¬ 
formance  metric  of  optimal  control.  The  necessary  conditions  are  given 
for  the  solution  of  this  problem  and  an  example  gives  some  practical 
insights. 

The  weaknesses  in  these  results  include  the  fact  that  the  necessary 
and  sufficient  condition  (normality)  for  minimal  root  sensitivity  are 
only  local  results  and  also  that  they  apply  only  for  non-defective  plant 
matrices  (A  has  linearly  independent  eigenvectors). 
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ABSTRACT 

The  weighting  matrices  in  the  standard  Linear  Quadratic  Gaussian  (LQG) 
theory  are  most  often  used  to  achieve  pole  assignment.  Instead,  this  paper 
proposes  to  select  these  weighting  matrices  to  achieve  RMS  bounds  on  inputs 
and  outputs. 

1.0  INTRODUCTION 

One  of  the  most  frequent  complaints  about  the  application  of  the 
standard  Linear  Quadratic  Gaussian  (LQG)  theory  is  the  "arbitrariness"  in 
the  choice  of  weights  Q  and  R  in  the  quadratic  cost  function 

V  -  E  lim  (  || y (t)  ||^  +  ||u(t)  ||jS  (1.1) 

t-n»  Q  R 

which  is  minimized  subject  to  the  constraints 

x  -  Ax  +  Bu  +  Dw  ,  xeRn  ,  ueR111  ,  weR^ 

y  *=  Cx  yeR  (system  outputs)  (1.2) 

l 

z  -  Mx  +  v  zeR  (system  measurements) 


Ew  *  0  ,  Ev  «  0  , 


T 

X  (t  ) 

[ xT(t  )  wT(t),  vT(t)  I  « 

X 

0 

0 

o 

L  °  -1 

o 

w(t) 

0 

W6(t-t) 

0 

v(t) 

0 

0 

V6(t-T) 

W  >  0  ,  V  >  0 

where  w(t)  and  v(t)  are  zero-mean  white  noise  processes  with  intensities 
W  and  V  respectively,  and  E  is  the  expectation  operator  and  T  denotes 
matrix  transposition.  In  the  literature,  there  have  been  two  basic  sug¬ 
gestions  for  the  choice  of  Q,  R. 

The  first  suggestion,  made  by  Bryson  [l],  relates  Q  and  R  to  the 

2  2 

input-output  specifications  ;  where  the  desired  values  of  the 

response  of  the  closed-loop  system  are 


or 


2  2 
y[U)  <  ol± 


2/  x  2 

u^t)  <_  v 


or 


yi 

‘7  ^ 


Ui  2 

(— )  <  1 


(1.3) 


leading  Co  Bryson's  suggestion  for  Che  choices  of  Q  and  R 


Q  - 

l/o2 

,  R  - 

• 

*  2 

L 

*  2 

mUvr 
_  m_ 

(1.4) 


Hence  with  this  choice  of  Q  and  R  (1.1)  becomes 

V  -  lim  {  l  Q,,(Ey2)  +  l  R..(Eu2)}  (1.5) 

t-*»  i-1  11  1  j  =  l  22  2 

It  usually  happens,  however,  that  even  though  this  choice  of  Q  and  R  has 
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some  physical  motivation  the  actual  u^(t)  do  not  satisfy  the 

desired,  bounds  (1-3).  Furthermore,  there  is  no  theory  available  which 
will  show  how  to  choose  Q  and  R  such  that  (1.3)  is  satisfied.  Neither  is 
there  a  method  available  which  will  guarantee  satisfaction  of  (1.3)  in  the 
mean  squared  sense 


Ey*(t>  <  a\ 


i-1,  ...»  k 
j—  1,  ...,m 


(1.6) 


The  second  basic  suggestion  about  the  choice  of  Q,  R  in  the  literature 
has  been  to  choose  Q  and  R  to  achieve  a  desired  pole  assignment  in  the 
closed-loop  system  [2]  ,[3].  In  [4]  a  gradient  technique  adjusts  the  ele¬ 
ments  of  Q,  R  to  maximize  the  singular  values  of  selected  return  difference 
matrices  which  are  related  to  stability  margins  and  disturbance  rejection. 
There  has  been  much  more  written  about  the  pole  assignment  role  for  Q  and 
R  than  about  the  time  response  assignment  (1.3)  or  the  variance  assignment 
(1.6).  This  is  indeed  curious  since  LQG  is  a  theory  steeped  in  the  time- 
domain  and  is  directly  concerned  with  the  time  responses  y(t)  and  u(t). 

One  might  even  argue  that  it  is  a  bit  obtuse  to  use  LQG  theory  to  do  pole 
assignment  since  there  are  more  direct  methods  to  assign  poles  which  do 
not  use  the  artifice  of  LQG  theory,  [5]. 

This  paper  deals  with  the  more  natural  use  of  LQG  theory  in  the  time- 
domain  question  relating  to  objective  (1.6).  The  ideal  goal  is  to  find  a 


X-. 


linear  dynamical  feedback  controller  which  satisfies  the  variance  con¬ 
straints  (1.6).  We  shall  alter  this  problem  statement  to  make  it  more 
tractable  by  using  the  steady  approximation  of  the  constraints  (1.6) 


lim  Ey^(t)  _<  o* 

t-HO 

lim  Eu^(t)  _< 
t-*»  1 


(1.7) 


Then,  a  penalty  function  approach  (i.e.  LQG  theory)  is  employed  to  accommo¬ 
date  the  constraints  (1.7),  in  the  manner  (1.5)  by  proper  choice  of  Q^, 
and  R_^.  This  formulation  automatically  satisfies  the  constraint  for  a 
linear  dynamical  feedback  controller,  since  the  optimal  LQG  controller  for 
(1.5)  employs  the  standard  Kalman  filter  and  state  feedback  control  gains. 
This  will  be  called  the 

J 

Constrain/ed  Variance  LQG  Problem  (LQG^)  : 

Find  Q  and  R^  in  (1.5)  eo  that  (1.7)  is  satisfied ,  subject  to 

(1.2). 


The  LQG  problem  does  not  have  a  solution  for  every  set  of  require- 
2  CV  2 

ments  o  ,  (i-1,  ....  k) ,  p, ,  (j-1,  . . . ,  m) .  To  see  that  this  is  true, 

1  J  k  _m 

consider  a  single  input-single  output  system.  Hence,  yeR  (k-1) ,  uek  (m«l) . 

The  LQG  theory  promises  an  inverse  relationship  between  output  regulation 

and  control  effort  as  the  weighting  R  on  control  effort  varies  from  0  to 

infinity,  as  depicted  in  Fig.  1. 


R 


liw  / u\t) 


Fig.  1  Results  of  LQG  theory  for  SISO  Systems 


.  **-  *'«■ 


Clearly  no  solution  to  the  LQGcv  problem  exists  for  the  problem  imposed 


by  the  values  (o  ,  y  )  but  an  infinite  number  of  solutions  exist  for  the 


values  (o^,  y^) . 


The  key  to  progress  beyond  the  Bryson  rule  (1.4)  is  to  provide  a 
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relationship  between  the  actual  variances  Ey^  ,  Eu^  and  the  choices  of 
Qii  and  in  (1.5).  This  step  is  provided  in  Section  2  and  is  based  upon 
the  cost  decomposition  results  of  [6].  An  iterative  algorithm  based  upon 
these  results  is  given  in  Section  3.  Section  4  applies  the  algorithm  to  a 
Large  Space  Structure. 


2.0  INPUT/OUTPUT  COST  ANALYSIS 
Under  the  assumptions  (2.1) 


(A,  B)  ,  (A,  D)  stabilizable  pairs 
(A,  C)  ,  (A,  M)  detectable  pairs 


(2.1) 


the  LQG  problem  (1.1)  (1.2)  has  the  solution  [  ] 
u  -  Gx  ,  G  -  -R_1BTKx 
KA  +  ATK  -  KBR_1BTK  +  CTQC  -  0 
x  -  (A+BG)x  +  F(z-Mx)  ,  F  -  PMTV-1 

pat+ap  -  pmtv-1mp  +  dwdt  -  0 

yielding  the  closed-loop  regulator 


(2.2a) 
(2.2b) 
(2.2c) 
(2. 2d) 


(2.3) 


System  (2.3)  is  a  linear  system  driven  by  white  noise  and  with  output 
y.  It  has  been  established  in  [6]  that  such  system  outputs  y satisfy  a 
"cost-decomposition"  property 

k+m  „  ? 

V  •  l  V*  -  11m  E\\y\\l  (2.4) 

i-1  ■L  t-«°  ^ 


where 


ua  i  91!^II 

'i  - 1 

t-H»  Ji 

The  calculation  and  study  of  the  "output  costs"  is  called  "Output  Cost 
Analysis"  in  [  6  ].  We  momentarily  postpone  the  calculation  of  V^. 

Now  the  choice  Q  -  I  in  (2.5)  leads  to 


vy  £  lta  i  E(i|idL 

t-*-»  -si 


\  E(-|M-  y  )  -  llm  E(y2)  i-1,  .... 

1  3^i  1  t-K»  1 


•5  »  |  e(|ML  »j) 

t-*-o°  “'j 


lim  E(u^)  ,  j=k+i 


i- 1 )  # . » ,  m 


and  for  Q.  =  I,  note  that  (2.4)  becomes  the  unweighted  cost  \J° 

r  =  lim  EC  ||y  II2  +  Hull2)  -  l  v{+  l  (2.8 

t-~>  i-1  1  j-i  J 

The  system  (2.3)  evaluated  according  to  (2.8)  yields  the  following  output 
cost  analysis,  using  the  output  cost  formula  given  by  Eq.  (4.14)  of  [6]. 

-  lim  E(y2)  -  Ikjlx  “  ciXci  (2,9 

t '  ^°° 

-  lim  E(u2)  -  R-2  Hbj!2  «  R"2  b*  Kb±  (2.9 


where  X  and  K  satisfy 


X  -  X+P  ,  PAT  +  AP  -  pmtv_1mp  +  dwdt 
K  -  KXK  ,  KA  +  ATK  -  KBR_1BTK  +  CTQC 
G  -  -r_1btk,  x(a+bg)t  +  (A+BG)TX  +  fvft 


(2.12 


T  -1 
F  -  PM  V 


These  "output  costs"  l/^  and  "input  costs"  provided  by  (2.9)  show  the 
actual  contributions  of  y^  and  u^  as  opposed  to  the  desired  contributions 


o  and 


These  results  now  allow  a  precise  statement  of  the  Q,  R  selection 
problem.  Note  that  the  weights  Q  and  R  appear  in  the  cost  (1.1)  which  is 
to  be  minimized  to  obtain  the  linear  controller  (2.2),  but  the  weights 
do  not  appear  in  the  cost  (2.6)-(2.8)  used  for  evaluation  of  the  controller 
Thus,  using  (2.9)-(2.12)  the  LQGcv  problem  reduces  to 

The  LOG  Problem: 

_  cv _ 

Find  Q,  R  such  that 

llc±Hx  —  °±  (2>13a) 

Rii  hMk-  (2-13b) 

where  X  and  K  are  given  by  (2. 10)-(2 . 12) . 

3.0  AN  ITERATIVE  ALGORITHM  FOR  THE  LQC  PROBLEM 

x  cv 

The  parameter  optimization  problem  posed  by  (2.13)  requires  the 

2 

simultaneous  solution  of  (n  +n+2k+2m)  algebraic  equations.  Gradient 
schemes  may  be  developed  for  such  purposes.  However,  we  wish  to  avoid 
gradient  schemes  and  the  attendant  numerical  problems.  Instead,  we  intro¬ 
duce  an  iterative  scheme  for  the  selection  of  Q^t  R^  such  that  the 
weighted  output,  input  costs  are  constant.  That  is,  if  l^(k)  denotes  the 
value  of  1/^  on  the  ktb  iteration  of  (^(k)  and  R^k),  then 

l^(k+l)  Q1;L(k+l)  -  l^(k)  Q(k)  for  all  k  (3.1a) 

and  similarly 

l/^(k+l)  Rj^(k+1)  “  l^(k)  R^(k)  for  aH  k  (3.1b) 

Now  if  these  iterations  converge  to  the  maximum  allowable  values 
(l/^(k+l)  +  <j2  and  l/^(k+l)-+-  y2)  on  the  (k+l)St  iteration  then  (3.1)  reduces 
to 

^(k) 

Qii  (k+1)  "  2~  Qii(k)  =  *i(k)  (3.2a) 

°i 

l?(k) 

Rii<k+1)  “  ~2~  Rii(k)  "  V*0  Rii(k)  (3.2b) 

Vi 


where  $(k)  and  i|Kk)  are  non-negative  numbers  [6].  Note  that  the 


constraint  (1.7)  is  satisfied  if  $(k)  _<  1  and  i^(k)  _<  1.  The  proposed 
algorithm  for  Q,  R  selection  is  now  summarized. 


The  LQGcv  Algorithm  for  Q,  R  Selection: 

STEP  I:  Set  k»0,  guess  an  initial  value  for  Q(0),  R(0)  and  solve 

PAT+AP  -  PMTV_1MP  +  DWDT  -  0  ,  F  -  PMTV_1  (3.3) 

STEP  II:  Compute  l/^(k) ,  l^(k)  and  check  for  solution 


Vj(k) 


'i  X(k) 


l^(k)  =  R~*(k)  HbJlJ 


(3. A) 


where  X(k)  ,  K(k)  satisfy  X(k)  -  X(k)  +  P,  K(k)  -  K(k)X(k)K(k) 
and  K(k),  X(k)  satisfy 


0  *=  K(k)A+ATK(k)  -  K(k) BR-1  (k)BXK(k)  +  CTQ(k)C  (3.5) 

0  -  X(k)[A+BG(k)]T  +  [A+BG(k)]  X(k)  +  FVFT  (3.6) 

G(k)  -  R_1(k)BTK(k)  ,  (3.7) 

A  Vi(k) 

-  — = —  <  1  do  not  change  Q 
i  li<w 

«*  — - -  >  1  do  not  change  R^ 

Wi 

If  $^(k)  >  1  change  Q  according  to 

Q11(k+1)  -  *t(k)  Qii(k)  (3.8) 

If  <^(k)  <  1  change  R  according  to 

R  (kfl)  -  *±(k)  Rii(k)  (3.9) 

Set  k=k+l  in  (3.4)-(3.7)  and  return  to  STEP  II  until  ^(k)  _<  1 
for  all  iKl,  ...  k  and  ^^(k)>  1  for  some  i*l,  ...  m. 

STEP  IV:  Change  according  to  (3.8).  If  \J^(k)<  1  change  R  according 

to  (3.9).  Jo  to  STEP  II  until  |\^(k+l)  -  i^(k)  |  _<  e  for 
ij^(k+l)>  1  (i.e.  no  more  change  in  out  of  spec  actuators). 


STEP  III:  If  4>±(k) 
If  *±(k) 


There  are  many  more  choices  for  a  Q  and  R  algorithm  other  than  the 

one  posed  above.  To  date,  this  algorithm  has  produced  the  best  results. 

However,  there  is  currently  no  proof  that  failure  of  this  algorithm  to 

converge  means  that  a  choice  for  Q  and  R  does  not  exist.  Research  is 

continuing  in  this  area.  The  algorithm  does,  however,  attempt  to  use  the 

"general"  (Fig.  1)  nature  of  LQG  theory  to  its  benefit.  Specifically, 

choosing  to  adjust  the  r  's  for  those  actuators  with  <K(k)  <1,  using 

ii  2  1  2 

(3.2b),  reduces  r^  and  should  serve  to  reduce  E^Cy..  }  and  Ew(u^}  for 

actuators  with  ^(k)>  1.  Using  (3.2b)  to  adjust  the  r^'s  on  actuators 

with  \p^(k)>  1  does  not  have  this  double  beneficial  effect.  In  addition, 

once  ip.  (k)  _<  1  for  all  i«l,  ...  k,  using  (3.2a)  to  adjust  q  .  will  reduce 
l  2  11 

q.^  which  should  serve  to  reduce  E  {u,}  for  all  i**l,  ...  m. 

^ii  00  i 

4.0  HOOP-COLUMN  ANTENNA  EXAMPLE 

The  LQGcv  algorithm  has  been  applied  to  a  model  for  a  hoop-column 

antenna  which  has  26  states,  39  measurements,  12  actuators,  and  24  outputs. 

A  detailed  description  of  the  model  is  presented  in  [6].  The  initial 

guesses  for  Q  and  R  were  appropriately  sized  identity  matricies.  Figures 

2,  3,  and  4  are  plots  of  $^(k)  and  ip^(k)  and  reflect  14  iterations  of  the 

algorithm.  The  numbers  appearing  along  the  horizontal  axis  represent  the 

output  or  actuator  number.  The  14  points  associated  with  each  horizontal 

number  are  the  values  of  $^(k)  or  ip^(ic)  for  each  iteration  from  1  to  14. 

As  was  expected.  Figs.  2  and  3  show  that  the  algorithm  forced  all  outputs 

to  be  at  or  below  their  maximum  values  (i.e.  1  or  below  on  the  plot). 

The  fact  that  the  algorithm  tried  but  could  not  drive  all  outputs  to  their 

2 

maximum  allowable  o^  is  mathematically  predictable  but  can  not  be  discussed 

at  this  time.  It  is,  however  directly  related  to  the  sensitivity  of  X 

to  changes  in  q  . 

11  2 

Figure  4  indicates  that  E  {u  }  for  all  actuators  converged  to  levels 

2  °°  i 

above  p  ,  and  the  algorithm  does  not  provide  a  solution.  However,  a 

1  2  2 
solution  is  obtained  if  p^  is  changed  to  the  converged  value  of  E^tu^}. 

As  mentioned  earlier,  the  failure  of  the  LQGcv  algorithm  to  converge  has 

yet  to  be  proven  as  sufficient  for  no  choice  of  Q  and  R  to  exist. 
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Fig.  2  Output  results  for  the  LQGcv  algorithm 
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5.0  CONCLUSION 


This  paper  has  presented  an  algorithm  for  selecting  the  Q  and  R 
weighting  matrices  so  that  the  steady-state  LQG  regulator  operates  within 
specified  RMS  bounds  on  the  input  and  output  variables.  The  algorithm 
is  iterative  and  requires  the  calculation  of  the  steady  state  control 
Ricatti  equation  and  a  steady  state  Lyapunov  equation  of  dimension  n  (i.e. 

#  of  states)  at  each  iteration.  This  is  a  considerable  computational 
savings  when  compared  to  the  requirements  of  a  standard  parameter  optimi¬ 
zation  gradient  technique.  Research  is  continuing  to  determine  if  failure 
of  the  algorithm  to  converge  to  an  appropriate  Q  and  R  is  necessary  and 
sufficient  condition  for  no  choice  of  Q  and  R  to  exist.  As  pointed  out 
by  the  hoop-column  example  of  Section  4,  even  when  the  algorithm  does  not 
determine  the  required  Q  and  R  it  provides  the  RMS  regulator  specifications 
for  which  Q  and  R  weights  have  been  found. 
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ABSTRACT 

This  paper  bridges  a  gap  between  two  known 
results  In  nodel  reduction  theory.  Cost  Equiva¬ 
lent  Realizations  [1]  are  known  to  match  the 
first  two  output  covariances  of  the  full-order 
system,  while  we  will  define  a  Stochastically 
Equivalent  Realization  as  one  that  matches  all 
the  output  covariances  [2].  The  purpose  of  this 
paper  Is  twofold.  First,  we  describe  methods 
for  obtaining  the  minimal  realization  that  mat¬ 
ches  the  first  two  output  covariances;  this  rea¬ 
lization  Is  shown  to  be  a  Cost  Equivalent  Reali¬ 
zation.  Second,  we  describe  a  method  for 
obtaining  a  realization  (not  necessarily  minimal) 
which  matches  any  specified  number  of  output  co- 
variances;  In  the  limit,  we  can  therefore  obtain 
a  Stochastically  Equivalent  Realization.  In 
addition,  certain  generalizations  of  Cost  Equi¬ 
valent  Realizations  are  described  which  apply  to 
infinite  dimensional  systems. 


I.  INTRODUCTION 

Assiane  y(n)  Is  a  stable  stationary  Gaussian 
k-vector  random  process.  The  time  marker  n  Is  an 
Integer  In  (— ,  •).  We  may  consider  y(n)  as  a 
vector  of  Independent  outputs  (rank  C  ■  dim  y(n)) 
generated  by  a  state  space  realization  of  the  form 


x(n+l)  •  Ax(n)  +  Bu(n)  u  r  if  (1) 

y(n)  ■  Cx(n)  y  t  R* 

where  A,  B,  and  C  are  matrices  of  the  appropriate 
dimension.  Assune  u(n)  Is  a  stationary  vector- 
valued  Gaussian  white  noise  process  such  that 


E[u(n)  u*(J)]  - 

nj 


where  Im  Is  the  m  x  m  Identity  matrix,  i  Is  the 
Kronecker  Delta,  and  E  Is  the  expectation. 

Define  *n  as  the  Toeplltz  covariance  matrix 


V  '•  •  Rn* 

R0  •  •  •  Rn-1* 


R„  -  E[y(n*j)  y*  (J)]  (4) 

It  Is  always  assumed  that  RQ  >  0. 

Recall  (for  n  >  0) 

R|)  •  CAn  XC*  where  X  4  E[x(n)  x*(n)]  (5) 

It  Is  well  known  that  X  solves  the  Lyaptaiov  equa¬ 
tion: 

X  •  AXA*  +  BB*  (6) 

Finally,  system  (1),  with  state  covariance  X 
will  be  denoted  by  (A,  B,  C,  X). 

We  now  define  a  "q-COVER"  (q  -  COVarlance 
Equivalent  Realization). 

Definition:  A  state  space  realization  of  the  form 


xp(n+l)  ■  Apxr(n)  +  Bpu(n) 
yr(n)  -  Cr*r(n) 


xr  c  CK 
u  c  R*1  (7) 

yr  c  Rk 


where  u(n)  is  a  white  noise  process  with  identity 
covarumce,  is  a  q-COVER  of  (1)  if 

E[yr(n*J)  yr*(j)]  •  R„  for  n  <  q  (8) 

A  minimal  q-COVER  Is  a  q-COVER  whose  state  space 
dimension  p  Is  minimal  over  the  class  of  all  q- 
COVERs  satisfying  (8).  A  minimal  1-COVER  Is  rela¬ 
tively  easy  to  obtain  and  has  been  done  via  Compo¬ 
nent  Cost  Analysis  (CCA)  [1].  A  minimal  —COVER  Is 
the  minimal  Stochastically  Equivalent  Realization 
(SER).  In  general,  however,  the  task  of  finding  a 
minimal  q-COVER  for  1  <  q  <  »  is  still  an  unsolved 
partial  realization  problem  [3].  The  purpose  of 
this  paper  is  to  describe  methods  for  obtaining  the 
minimal  1-COVER,  and  to  show  that  It  is  the  Cost 
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Equivalent  Realization  (CER)  In  [1].  For  complete¬ 
ness  we  also  describe  a  method  for  flndlna  a  gene¬ 
ral  q-COVER  (not  necessarily  minimal)  of  (1)  for 
1  <  q  <  -.  This  Is  essentially  a  generalization  of 
the  CEX. 

We  feel  these  methods  are  valuable  for  a  num¬ 
ber  of  reasons.  First,  they  produce  a  CER  much 
faster  than  the  method  In  [1].  Second,  the 
approach  Is  much  more  general  than  previous  methods 
and  can  be  applied  to  Infinite-dimensional  systems. 
Third,  a  q-COVER  (q  »  1)  will  provide  a  more  faith¬ 
ful  model  than  a  CER.  Finally,  this  problem  Is 
really  a  special  case  of  the  more  general  Partial 
Realization  Problem. 


II.  MINIMAL  1-COVER:  OIRECT  METHOD 

In  this  section  we  obtain  the  minimal  1-COVER 
for  any  system  (A,  B,  C,  X).  Assume  Rq  and  Rj  are 
the  first  two  output  covariances  of  (A,  B,  C,  X) 
which  we  wish  to  match  with  a  reduced  system  (Ar, 
Br.  Cr.  Xr). 


To  begin,  define  Cr  as  the  positive  definite 
squere  root  of  R_.  Using  the  fact  that  R^*  >  0, 
we  have 


where  Ar  6  C,.*1  RiC-*'1 .  It  can  easily  be  shown  that 
Equation  (9)  Implies  that  (lArll  <  1  or,  equiva¬ 
lently,  that  (I  -  A^r*)  >  0.  We  can  therefore 
define  8r  as  the  positive  square  root  of  (I-ArAr*). 

We  claim  that  (Ar.  Br,  Cr,  Xr)  Is  a  minimal 
1 -COVER  of  (A.  B.  C,  X).  To  show  this,  note  that 
Xr  ■  I,  and  therefore 

*or  ‘  CrV  “  R0  (10) 

Rlp  -  CrArCr*  -  WVr*'1)^*  *  *1 

The  system  (Ar.  Br,  Cr,  I)  Is  therefore  a  1-COVER. 
It  is  minimal  because  the  state  and  output  vectors 
are  of  the  same  dimension.  We  can  now  restate  the 
above  as 

Theorem  1 :  The  eyetem  (Ar,  Br,  Cr,  I)  with  paraim- 
t ere  defined  by 

Cr  -  *(Ro)1/2  V  "  Cr'1*!*/'1  Bf-  ♦(I-ArAp*)1/Z 

(ID 

ie  a  minimal  1-COVER  of  (A,  B,  C,  X). 

Now  consider  a  different  minimal  1-COVER  of 
(A,  B,  C.  X)  denoted  by  (X,  X,  Z,  X)-  Since  both 
(A_,  Br,  Cr,  I)  and  (A,  X,  X,  I)  are  minimal 
1 -COVERS  of  (A,  B,  C,  X)  they  must  be  of  the  same 
order.  We  say  that  any  two  systems  of  the  same 
order  are  "equivalent*  If  there  exists  a  nonsingu¬ 
lar  transformation  P  such  that 

C,.  •  Zp  Ar  -  P_1XP  (12) 


and 

E[y  (n+j)y  *(J)]  «  E[yr(n-t-j)yr*(J)]  for  all  n,J 

(13) 

If  P  Is  a  unitary  matrix,  we  say  that  (Ar,  Br,  Cr.D 
and  (X,  X,  X,  X)  are  "unltarlly  equivalent". 

Theorem  S:  All  minimal  1 -COVERS  are  equivalent. 

Proof:  (The  proof  Is  similar  to  well  known  argu¬ 
ments  In  [4]  concerning  equivalent  systems. 

For  completeness  It  Is  provided  here.) 

Assume  that  (AP,  Br,  Cr,  I)  and  (X,  X,  X,  X) 
are  both  minimal  1-COVERs  of  (A,  B,  C,  X),  We  will 
show  that  a  nonsingular  P  exists  to  satisfy  (12) 
and  (13). 

Now,  since  (Ar.  Br,  Cr,  I)  and  (I,  X,  X,  X) 
both  have  the  same  Rq  and  R]  by  hypothesis,  using 
(8)  and  the  definition  X  •  E[x(n)  x*(n)]  yields 

R  ■  C  C  *  *  CXX*  R,-C  A_C  *  •  CXXX* 
o  r  r  i  r  r  r 

(14) 

Let  X  be  the  positive  square  root  of  X  »  0.  Then 
the  first  equation  In  (14)  yields 

II  Cr*f  |f  -  ||  X"X*f  ||2  ftfk  05)' 

This  Implies  that  there  exists  a  unitary  operator  u 
on  fk  such  that  Cr*  •  U*X*X*.  Hence  C_  -  CPU.  Let¬ 
ting  P  ■  Xu  alves  Cr  •  XP,  as  required  In  the  first 
equation  In  (12).  Since,  by  hypothesis,  R1  •Rl»X|, 

CrArCf*  «  CXXX*  -  CXPX*X*  »(CpP"1  )XPUU*X*X* 

•  CrP'^PP-X*  -  CrP^XPCr*  (16) 

Since  Cr  Is  Invertible,  this  leads  directly  to  Ar  ■ 
P''XP  as  required  In  (12).  Proof  of  (13)  follows 
from 

£[yr(n)yr*(0)]  -  CrArnCr*  -  XPArnP*X* 

-  XPArV1PP*X* 

-  CX*1  pp*X*  «  CXVuu*?*?*  «  Ca°PP*c* 

-  ZFxZ*  -  E[y(n)y*(0)]  (17) 

where  n  >  0.  A  similar  argument  holds  for  n  <  0. 
This  completes  the  proof. 

Reiiark  1 .  By  Theorem  1,  If  X  ■  I  then  the  operator 
P  In  (12)  Is  unitary.  This  follows  from  the  proof 
where 

PX*  •  X  •  I  (18) 

Hence,  (AP,  Br,  Cr,  I)  and  (X,  X,  Z,  X)  are  unlta¬ 
rlly  equivalent. 

Remark  2.  Assume,  as  before,  that  Co  >  0.  Follow- 
Ing  15J  •  random  process  y(n)  Is  Markov  If  y(n) 
adults  a  representation  of  the  form  (5)  such  that 
the  dimension  of  x(n)  equals  the  dimension  of  y(n). 
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Theorem  2  shows  that  y(n)  •  yr(n)  when  y(n)  Is  Mar¬ 
kov.  Notice  the  operator  P  in  (12)  does  not  inter¬ 
twine  the  operators  Br  and  J  In  (AP,  8r,  Cr.  1)  and 
(K,  ff,  Z,  T).  This  Is  not  unusual  for  stochastic 
systems  [5]. 

Remark  3.  Theorem  2  cannot  be  generalized  for  q- 
COVERs  where  q  >  1. 


III.  MINIMAL  1-COVER:  PROJECTION  WTWOD 

Given  an  Initial  state  space  realization  (A,  B. 
C.  X),  we  wish  to  obtain  a  minimal  1-COVER  directly 
from  It.  The  approach  In  this  section  Is  equivalent 
to  the  method  In  [1]  of  obtaining  a  Cost  Equivalent 
Realization  of  (A,  B.  C.  X). 

Consider  a  system  (A,  B.  C.  X).  Without  loss 
of  generality,  assune  X  ■  In._  (This  can  always  be 
done  by  defining  a  new  state  x  ■  Px  through  the 
appropriate  transformation  P.)  Now  define  H  as  the 
range  space  of  C*  by 

H  •  HMQ  (19) 


where  N-1-  denotes  the  nullspace  perpendicular. 

Define  the  following  operators,  A  on  H  and  Cj:  H-»Rk 
by 

A,  4  P„  A|H  C,  *  C | H  (20) 

where  PH  denotes  the  orthogonal  projection  onto  the 
subspace  H,  and  the  symbol  "|"  means  restricted  to. 

Now  define  Bi  as  the  positive  square  root  of 
(I  -  A-| Ati *) .  Note  that  B,  Is  well  defined  since 

the  Lyapunov  equation  (6)  with  X]  «  I  shows  that 
( I— A) A-| ■*)  >  0.  Obviously,  with  y,  ■  C]Xi,  we  have 

Ro  -  CC*-C,C1*  -  ECyi (n)y, *(n)]  (21) 

R,  -  CAC*«C, A, C, *  -  E[yj(n+1  )y,*(n)]  (22) 

Therefore  (A],  B, ,  C,,  I)  Is  a  minimal  1-COVER  of 
(A,  B,  C,  I).  By  Theorem  2,  It  Is  unltarlly  equiva¬ 
lent  to  (AP,  Br,  Cr,  1)  found  In  Section  11. 

We  will  now  demonstrate  this  approach  with  the 
following  generic  example: 


Example: 

STnce  every  state  space  realization  cam  be  put 
Into  a  Generalized  Hessenberg  form  [6],  th«  trans¬ 
formed  to  coordinates  where  X«l,  we  will  assiene. 


A  • 


no 

loss  of  generality,  a  system  of  the  form 

B.  C 

,  I)  where 

*11 

*12 

*13  •  • 

*ln 

*21 

*22 

*23  *  * 

*2n 

x  *  C* 

0 

*32 

*33  •  * 

u  c  rT  (23) 

0 

0 

*43  ‘  • 

y  c  Rk 

0 

0 

0  .  . 

*nn_ 

C  -  [Ck  0 . 0] 

and  B  Is  the  positive  square  root  of  I-AA*  to  satis¬ 
fy  (6),  with  X  -  1. 

Now,  using  the  projection  approach  in  this 
example  yields 

H  •  N-m  «  {[Rk  0 . 0])*  (24) 

A1-PH(A|H)«A11  C1-C|H-Ck  B^+d-A^*)172  (25) 
Obviously 

tty,  (»>>>! *(nn  -  R0  Ety^n+Dy^tn)]  -  R,  (26) 

The  system  (Ai,  Bi,  Ci.  I)  of  dimension  k  Is  there¬ 
fore  a  minimal  1-fcOVER  of  (A,  B,  C,  I). 

NOTE:  The  method  used  In  Section  II  only 
required  R0  and  R, ;  no  Initial  state  space  realiza¬ 
tion  was  required  to  obtain  a  minimal  1 -COVER.  How¬ 
ever,  the  projection  method  above  (or  equivalently 
In  [1]),  requires  an  Initial  state  space  realization 
(A,  B,  C,  X)  to  obtain  the  minimal  1-COVER.  The 
results  are  Identical. 

Now,  compare  the  methods  In  Sections  II  and  III 
assuming  both  start  with  an  Initial  state  space 
representation  (A,  B,  C,  X).  The  method  In  Section 
II  for  obtaining  a  minimal  1-COVER  Is  much  faster 
than  the  projection  method  or,  equivalently,  that  In 
[1].  Both  the  projection  method  and  [1]  require  the 
original  ntb  order  system  (A,  B,  C,  X)  to  be  trans¬ 
formed  to(A',B',  C',  I).  This  requires  finding  an 
n-dlmenslonal  transformation  xV2  and  Its  n- 
dl  mens  Iona!  Inverse.  The  method  In  SectlonTi  allows 
one  to  go  Irnnedlately  to  the  smaller  k  (output)- 
dlmenslonal  space.  In  which  one  must  then  only  com¬ 
pute  Cr  and  Its  k-dlmenslonal  Inverse.  Since  always 
k  <  n,  and  In  many  cases  k  <  *  n.  It  Is  computation¬ 
ally  much  faster  to  obtain  a  1 -COVER  by  the  method 
of  Section  II  than  by  the  projection  method  and  [1]. 


IV.  q-COVER  BY  PROJECTION 

Given  a  system  (A,  B,  C,  I)  we  wish  to  construct 
a  realization  (A„,  B„,  Cq,  I)  of  lower  order  that 
matches  up  through  tne  qth  covariance  of  the  origi¬ 
nal  system.  This  can  be  done  In  two  different  ways. 
First,  by  using  the  results  In  [7]  and  elsewhere 
with  Rg  ...  Rg,  one  can  obtain  the  well  known  maxi¬ 
mal  entropy  realization  of  Rq,  which  Is  a  q-COVER. 

In  fact,  the  1-COVER  Is  precisely  the  maximal  entro¬ 
py  realization  of  Rg,  R,  (l.e.,  the  rest  of  the 
Schur  parameters  are  zero).  The 

other  approach,  which  we  wish  to  pursue  here.  Is  a 
generalization  of  the  projection  approach  used  In 
the  previous  section.  He  expect  the  more  general 
q-COVER  obtained  by  projection  to  provide  a  better 
reduced  order  realization  of  (A,  B,  C,  I)  than 
either  the  1-COVER  or  the  max  liman  entropy  realiza¬ 
tion. 


Consider  a  system  (A,  B,  C,  I).  Define  the 
subspace  H-c  by 

H„  -  X  A’h  (27) 


1*0 


C4 


where  H,  the  range  space  of  C*.  Is  defined  In  (19). 
The  subscript  "qc"  denotes  that  we  are  obtaining  a 
realization  that  Hatches  up  through  the  qth  covari¬ 
ance  by  defining  Hqr  as  part  of  the  "q-controllabl- 
llty"  subspace  of  (A,  C*). 

Now  define  the  following  operators.  Aac  on  Hae 
and  Cqc:  Hqc  *  Rk  by  M 

V‘V‘IV  v*c|v  U8) 

Noting  that  ||  A--  ||<  ||A  ||  <  1 ,  we  can  therefore 
define  “ 

Bqc  *  ♦»  '  Vqc*)V2  (29) 

We  claim  that  (AqC.  BqC,  CqC>  I)  Is  a  realiza¬ 
tion  (not  necessarily  minimal)  which  matches  up 
through  the  qth  covariance  of  (A.  B,  C,  I).  To  see 
this,  consider  the  case  q  *  3.  Using  (6)  with  X*I 
yields 

1*3  «  CA3C*  (30) 

Now,  using  the  fact  that  Hjcc  H2cc  Hjc.  and,  from 
(27)  that 

A  H  C  k  ■  1 ,2,. (31) 

we  can  see  that 

R,  -  CPU  (A3C*)  -  CPH  AP„  (A2C*) 

3  "3c  H3c  H3c 

"  ""wS/V**  ‘  Vfc'V**:  (32> 

Similarly,  we  can  show  that  R£  ■  R2C,  Ri-Rlq,  Ro'Roe- 
'■to  therefore  see  that  (A3C,  63c*  ^30*  1*  • 

3-COVER  for  (A,  B,  C,  I).  For  arbitrary  q.  the 
proof  Is  analogous. 


We  could  have  obtained  a  q-COVER  by  defining 

•  'v*'“  <33> 

where  Hgo  is  part  of  the  "q-observabll 1ty“  subspace 
of  (A,CJ.  We  then  would  have  used  H„0  In  place  of 
HgC  In  (28).  The  resulting  q-COVER  (%,o.  Bqo.  Cqo« 
i;  could  be  of  either  larger  or  smaller  dimension 
than  (Aqq.  Bqc,  CqC,  I). 


To  assure  that  one  obtains  the  smallest  dimen¬ 
sion  q-COVER  possible  by  this  method  (though  not 
necessarily  minimal),  one  should  take  the  q-COVER 
(Aqr.  Bgr.  C„c,  I)  obtained  Initially  and  use  this 
realization  to  project  onto  Its  “q-observabll  Ity" 
subspace.  In  other  words,  after  obtaining  (AqC, 
Bqc*  Cqc*  H  *e  form 
q-1  m 

-  v  Aj  H  (34) 


We  can  then  obtain 


Vo-  PHqcoVlHqco  Vo'CqdHqco  <3B> 
Bqcc  *qco*qco*^ 

Therefore  (AqC0,  Bqco,  CqC0,  I)  is  a  q-COVER  for 


(A,  B,  C,  I)  of  dimension  less  than  or  equal  to  the 
dimension  of  both  (AqC,  Bqc,  Cqc,  I)  and  (Ap0,  Bqo, 
Cqo»  !)•  1 

The  example  In  Section  III  can  be  easily  exten¬ 
ded  to  demonstrate  this  method.  Oue  to  lack  of 
space,  however,  we  are  not  able  to  Include  that 
demonstration  In  this  paper. 

NOTE  1:  If,  at  any  time,  we  find 

Vl)co-Hqco  (37) 

then,  obviously  HqC0  ■  H~co-  In  particular,  the 
Cayley-Hamll ton  Theorem  Implies  that  once  q  »  n.(37) 
will  automatically  hold.  When  (37)  does  apply,  then 
the  q-COVER  Is  an  —COVER,  and  Is  therefore  an  SER. 
For  the  smallest  q  satisfying  (37),  the  applicable 
(Aqco»  8qco»-£qco •  ^  Is  a  Minimal  SER  of  Rn. 

NOTE  2:  The  maximal  entropy  approach  will  always 
yield  a  q-COVER  of  order  q,  even  when  q  >  n,  while 
our  approach  often  gives  a  lower  order  q-COVER. 


V.  CONCLUSION 

A  q-COVER  (q  COVarlance  Equivalent  Realization) 
has  been  defined  as  a  realization  that  matches  up 
through  the  Rq  covariance  of  a  system  or  process. 

Two  methods  have  been  developed  for  obtaining  a  mini¬ 
mal  1-COVER.  One  only  requires  Rg  and  R^ ,  while  the 
other  requires  an  Initial  state  space  representation 
and  Is  essentially  equivalent  to  the  Cost  Equivalent 
Realization  method  In  [1].  Finally,  a  projection 
method  was  described  which  can  be  used  to  obtain  a 
q-COVER  when  q  -  2,3,...«.  When  q  -  -,  the  —COVER 
Is  a  Stochastically  Equivalent  Realization. 
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ABSTRACT 

Component  Cost  Analysis  [1]  is  used  to  develop  a  method  for  control¬ 
ler  reduction.  The  reduction  of  the  controller  is  based  upon  the  parti¬ 
cipation  of  the  controller  states  in  the  value  of  a  quadratic  performance 
metric.  The  controller  states  which  have  the  smallest  contribution  to 
the  performance  metric  are  truncated  to  produce  the  reduced  controllers. 
An  error  index  is  defined  to  evaluate  the  reduced  controller  compared 
with^optimal  LQG  controller,  and  bounds  on  this  index  are  derived.  A 
numerical  example  is  included  to  illustrate  the  procedure. 


I.  INTRODUCTION 


The  straightforward  application  of  Linear-Quadratic-Gaussian  (LQG) 
theory  in  practical  applications  is  hindered  by  these  limitations: 

(i)  The  plant  model  which  is  accurate  enough  to  serve  in  the 
evaluation  of  candidate  controllers  is  often  too  complex 
for  direct  LQG  computations  (beyond  "Riccati -solvable" 
dimension). 

(ii)  A  traditional  approach  to  synthesize  a  controller  of  a 
specified  order  is  to  first  apply  (one's  favorite)  model 
reduction  theory  to  obtain  a  low  order  model  (compatible 
with  on-line  controller  hardware/software  limitations), 
and  then  to  apply  optimal  control  and  estimation  theory 
to  obtain  a  LQG  controller  which  is  optimal  for  the 
reduced  model.  However,  this  optimization  is  based  upon 
a  reduced  order  model  which  was  guaranteed  "close"  to 
the  plant  only  in  the  open- loop.  Since  the  control 
inputs  can  drastically  affect  the  behavior  of  the  system 
(and  the  quality  of  the  reduced  model),  reliable  model 
reduction  cannot  be  performed  without  some  knowledge 
of  the  inputs.  But  since  the  control  inputs  are  yet 
to  be  determined,  this  is  tantamount  to  admitting  that 
the  modeling  problem  and  the  control  problem  are  not 
independent  problems }  as  was  pointed  out  in  [lb]. 

Component  Cost  Analysis  (CCA)  [1-3J  was  introduced  as  an  attempt 
to  unify  the  model  reduction  and  control  design  problems.  In  the 
open-loop  model  reduction  versions  of  the  CCA  theory,  the  state 
dependent  term  in  the  quadratic  performance  measure  (which  is  intended 
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for  use  in  later  control  design)  is  used  in  the  model  reduction  decisions. 
In  this  way,  the  modeling  and  control  problems  were  "integrated"  in  [1-3]. 
In  these  references,  the  following  LQG  control  design  strategy  is  sug¬ 
gested  to  obtain  a  controller  of  order  nc,  beginning  with  a  model  of 
order  n>>nc,  where  n  is  too  large  for  solutions  of  Riccati  equations. 

Suboptimal  LQG  Design  Strategy 

(a)  Apply  open-loop  CCA  model  reduction  methods  [1-3]  to  the 
high-order  evaluation  model  of  order  n.  Reduce  the  order 
from  n  to  nr,  where  nr  is  the  "Riccati-sol vable"  dimension 
of  the  local  offline  computer. 

(b)  Solve  for  the  optimal  LQG  controller  for  this  model  of 
order  nr.  This  yields  a  controller  of  order  nr,  where 

V  V  n- 

(c)  Apply  controller  reduction  CCA  methods  to  reduce  the 

controller  from  order  n_  to  order  n  <n  . 

r  c  r 

It  is  emphasized  that  model  reduction  [step  (a)]  and  controller-reduction 

[step  (c)]  are  different  mathematical  procedures. 

The  intended  advantage  of  this  strategy  over  the  traditional  approach 

(ii)  [which  sets  nr  *  nc  and  skips  step  (c)]  is  that  more  information 

about  the  higher  order  system  and  its  would-be  optimal  controller  is 

made  available  for  the  design  of  the  reduced  controller. 

The  purpose  of  this  paper  is  to  present  a  reliable  controller 

reduction  scheme  to  accomplish  step  (c)  for  infinite  time,  stationary 

LQG  problems.  This  paper  assumes  that  a  reliable  reduced  model  of 

order  nf  is  available.  We  will  denote  this  model,  by  S(nr).  Let  this 

reduced  model  of  order  n  be 

r 


i 
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tx  =  Ax+Bu+Dw 

y  =  Cx  0  -1 ) 

z  *  Mx+v 

where  xeR  r,  yeR^,  ueR01  and  zeR*'.  The  disturbances  weR^  and  veR*  are 
assumed  to  be  uncorrelated  zero-mean  white  noise  processes  with  inten¬ 
sities  W  >0  and  V>0  respectively,  and  under  the  standard  assumptions 
the  matrix  pairs  {C,A}  and  {M,A}  are  observable  and  the  matrix  pairs 
{A,B}  and  { A, D>  are  controllable.  The  vector  z  is  composed  of  the 
measurements  corrupted  by  the  noise  v.  The  vector  v  contains  only 
the  variables  which  are  used  to  measure  the  performance  of  the  system 
via  a  cost  function  1/  defined  as  follows 


V  =  lim  E{l/(t)> 

t-H» 


(1.2a) 


where 

‘'(t)  -  ||y(t)|!g  +  Ilu(t)|||  (i.2b) 

2  T 

The  notation  ||y||g  denotes  y  Qy,  and  Q>  0  and  R>  0  are  weighting  matrices. 
E  denotes  the  expectation  operator. 

Step  (b)  of  the  Suboptimal  LQG  Design  Strategy  is  to  obtain  the 
optimal  controller  [which  minimizes  the  cost  function  V,  in  (1.2)]  for 
(1.1).  This  controller,  denoted  by  Sc(nr),  is  given  by  [4]. 

I  x  =  Acx  +  Fz  n 

Sc(nr):  j  ,  xcR  r  (1.3a) 

(u  =  Gx 


where 


(1.3b) 


Ac  5  A+BG-FM  ,  G  ■  -R  'ST;  F  =  PM'V'1 

KA+ATK  -  KBR_1BTK  +  CTQC  =  0 
PAT  +  AP  -  PMTV_1MP  +  DWDT  =  0 

Since  the  controller  (1.3)  is  of  order  nr>nc»  a  reduction  of  this 
controller  (1.3)  to  order  n£  is  now  required.  [Step  (c)  of  the 
strategy]. 

Past  Approaches 

Since  the  original  version  of  this  paper  was  submitted,  Verriest 

[5,6]  has  proposed  to  select  a  set  of  coordinates,  named  "LQG-balanced" 

coordinates,  in  which  the  Riccati  solutions  K  and  P  of  (1.3c)  and  (1.3d) 

are  'balanced'  in  the  sense,  K  =  P  =  n  =  diag  {ir^ ,  ^ . Trn  },  and 

r 

to  delete  those  n  -n  controller  states  that  have  the  smallest  tt,.-s, 
yielding  a  reduced  controller  of  order  nc.  These  deleted  states  are 
interpreted  as  those  states  that  are  estimated  with  the  least  uncertainty 
(measured  by  of  P)  and  have  the  least  contribution  to  a  'fictitious' 
cost  function  (measured  by  of  K).  This  'fictitious'  cost  function 
is  evaluated  by  assuming  that  all  the  plant  states  are  available  for 
feedback.  But,  when  the  plant  states  are  not  available,  one  could 
feedback  only  the  estimates  [which  is  precisely  why  an  estimator  (1.3a) 
is  constructed],  and  the  contribution  of  the  states  to  the  actual  cost 
function  l /  may  be  quite  different  from  it.. ,  as  can  be  seen  from  Eq.  11 
of  [6]. 

The  work  of  Kosut  [7]  and  Wenk  and  Knapp  [8]  should  also  be 
mentioned  since  they  also  deal  with  closed- loop  methods  for  controller 


(1.3c) 

(1.3d) 
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simplification.  However,  since  they  treat  parameter  optimization 
approaches,  their  work  is  along  totally  different  lines  than  the 
work  herein. 

We  consider  the  contribution  of  these  controller  states  to  the 
cost  function  (1.2),  and  delete  those  controller  states  having  the 
smallest  contribution  to  the  cost  function  (1.2).  Hence  the  proposed 
controller  reduction  algorithm  follows  the  following  three  steps. 

Contro l ler-redtuction-a Igorithm 

1.  Select  a  suitable  basis  for  the  controller.  The  contribution 
of  each  of  the  controller  states  to  the  cost  function  (1.2) 
is  precisely  measured  by  a  metric  called  the  'component 
cost',  5^(x);  i=l,  2,  ....  nf.  Compute  the  component  costs. 

2.  Rank  the  controller  states  so  that 

l ^  (x)  >_  Vz(x)  >  •  •  •  >_  Un  (x)  . 

r 

3.  Delete  the  last  n  -n„  states  to  obtain  the  reduced  controller. 

r  c 

Of  course,  this  algorithm  is  similar  to  the  "CCA  Model  Reduction 
Algorithm"  given  in  [2],  where  plant- states  x..  are  considered  for 
truncation,  unlike  the  above  controller-reduction  algorithm  where  the 
controller  states  x..  are  the  only  candidates  for  truncation.  Both 
these  algorithms  use  the  basic  concepts  of  Component  Cost  Analysis  [1], 

The  organization  of  the  paper  is  as  follows.  Section  II  presents 
some  preliminary  results  pertaining  to  controller-reduction  schemes  to 
aid  in  the  evaluation  of  reduced  controllers,  and  develops  a  'controller- 
error-index'  for  this  purpose.  The  main  contribution  of  this  paper 
is  in  Section  III  where  the  concepts  of  CCA  are  used  to  systematically 
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develop  a  set  of  coordinates  to  be  used  for  controller  reduction  studies. 
Reduced  controllers  obtained  in  this  representation  are  evaluated  with 
respect  to  the  performance  of  the  system  (1.1).  A  Solar  Optical  Telescope 
is  considered  as  a  numerical  example  in  Section  IV  to  illustrate  the 
proposed  method.  A  comparison  of  these  controllers  with  those  obtained 
by  the  LQG-balanced  method  [5,6]  is  also  made.  The  numerical  scheme 
for  the  computation  of  the  above  mentioned  coordinates  is  presented  in 
Appendix  A. 

II.  PRELIMINARIES  TO  CONTROLLER  REDUCTION 

Let  the  model  S(nr)  in  (1.1)  be  partitioned  as  follows: 


[C 


ar  art 

XR 

JHr  *t  _ 

_XT_ 

V 

+ 

u  + 

_bt_ 

_°I. 

(2.1) 


z  =  [Mr  Mt] 


+  v 


c  t 

where  xDe R  and  xTe R  ,  t+n  =n  ,  with  all  the  matrices  appropriately 

K  i  c  r 

dimensioned.  The  corresponding  partitioned  form  of  the  controller  Sc(nr) 
in  (1.3a)  is 


XR 

Ac 

CR 

AC 

lrt 

~xr" 

~V 

* 

* 

+ 

_XT_ 

Ac 

°TR 

o 
— i 

1 _ 

_XT_ 

[gr  gt] 


u 


(2.2) 
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where  xReR  c  (note  that,  AQ  =  AR+BRGR-FRMR) .  Now  augmenting  (2.1) 

R 

and  (2.2)  together,  the  closed  loop  system  can  be  written  as 
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(2.4d) 


Now  assume  that  a  reduced  controller  of  order  n.  denoted  by  S  (n_) 

t  c  c 

is  obtained  from  (2.2)  by  deleting  Xy,  to  yield. 


Sc(nc): 


p  =  Aj.  p  +  F^z  ,  pcR 
R 


UR  =  GR° 


(2.5) 


The  rest  of  this  section  devotes  attention  to  the  evaluation  of  this 
controller  by  considering  the  value  of  the  cost  function  (1.2)  when 
S(nr)  of  (2.1)  is  driven  by  (2.5).  For  the  convenience  of  evaluation, 
S(n  )  and  S  (n  )  are  augmented  to  obtain 

«  V  w 


XR  *  *RXR  +  ^R  ^  *  XRe^ 
yK  =  CRXR 

A 

which  results  from  the  truncation  of  Xy  from  (2.3). 
where 

*RT  5  WW 

and 

yR  =  ^RT’  ur'^  ' 


(2.6) 


It  is  easy  to  verify  that  {AR,  PR,  CR}  satisfy  the  definitions  in  (2.4b, 
c,  d),  and  that  the  cost  function  for  the  evaluation  of  (2.6)  is 


=  lim  E{Mt)} 


(2.7a) 


where 

^(t)  *  ||ys(t)||2+  ||uR(t)|||  . 

The  cost  functions  V  and  are  computed  by  [4] 

V  =  Tr[CTacx] 

"r  -  T^r^r^r! 
with  X  and  X^  satisfying 

XAT  +  AX  +  WVJ  =  0 

Vj  +  Vr  +  \WR  ’  0 


(2.7b) 

(2.8a) 

(2.8b) 


(2.9a) 

(2.9b) 


where 


A  = 

ar  art 

<11 

* 

V 

_atr  ^  _ 

X 

c  =  [cR  cT] 


(2.9c) 


0 

R 


The  following  definitions  and  lenmas  will  prove  useful  in  the 
subsequent  development  of  the  controller  reduction  algorithm  proposed 
herein.  (The  associated  proofs  are  given  in  Appendix  B). 

Definitions: 

2.  The  Predicted  Controller  Index  l(nr»nc)  is  defined  by 

I(nr.nc)  -  l  (!/R-l/)  (2.10a) 


where 
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'  Tr^CRT^CR^R^  +  Tr^CR  aCT*RT^ 


(2.10b) 


and  where  XR  and  XRy  are  obtained  from  partitioning  X  as 


XR  XRT 


XRT  XT 


2.  The  Controller  Error  Index  I (nr ,nc )  is  defined  as 


HVnc>  ■  7  'V1" 


(2.11) 


In  the  definitions  above,  the  arguments  nr  and  nc 
indicate  the  order  of  the  plant  and  the  controller 
respectively. 


Lemma  1: 


1.  The  error  indices  l(nr,nc)  and  I(nr,nc)  satisfy  the 


following 


(i)  I(nr,nc)  =  -  £  (TrCc/aC^y]  +  Tr[CRT£CTxJT])  (2.12a) 


(ii)  Knr,nc)  =  7(Tr[CRTaCR(XR-XR)]-Tr[CTTaCTXT] 


ZTrCcJsc^1]) 


(2.12b) 


where  (XR-XR)  satisfies 

T 


(xr-xr)ar  +  Vxr“xr}  -  u  m  0 

and  where 

u  =  [art£rtt  +  xrtartt]  . 


(2.12c) 


(2-1 2d) 
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(iii)  max  {0,  y  (nc )  Xm  (XR)  - 1 }  <_  I(nr,nc)  <_  max  {0,u(nc)xM(XR)-l } 

(2.13a) 

where  Xm(*)  [x^*)]  denotes  the  minimum  [ maximum ]  eigen¬ 
value  of  the  matrix  (*)»  cmd 

u(nc)  =^Tr[CRTaCR3  (2.13b) 

where  the  argument  nc  is  used  to  denote  the  dependence 
of  v  on  nc- 

The  predicted  controller  error  index  is  exact  l(nr,nc)  * 

7(nr*nc),  under  any  of  the  following  conditions: 

(i)  necessary  and  sufficient  condition  . 

Tr[CRTaCR(XR-XR)]-Tr[CRTaCTXRTT]  *  0  (2.14a) 


( i i )  sufficient  conditions 

a)  Xy  is  unobservable  in  u  in  Sc(nr)  (2.14b) 

b)  Xy  is  uncontrollable  from  z  in  Sc(nr)  (2.14c) 

c)  XRy  *  0  with  Ar  stable  (2.1 4d) 

The  controller  error  index  l(nr,nc)  is  zero  under  any  of  the 
following  conditions 

i )  necessary  and  sufficient  condition 

Tr[CRQCR(XR-XR)]  -  Tr[cj()CyXy]  -  2TricJoCyXjy]  =  0 

(2.15a) 

ii)  sufficient  conditions 

a)  Xj  is  unobservable  in  u  in  S^ir >r)  (2.15b) 

b)  Xy  is  undisturbable  from  z  in  ^(n^)  (2.15c) 


III.  CONTROLLER  REDUCTION  BY  COMPONENT  COST  ANALYSIS 


The  idea  of  Component  Cost  Analysis  (CCA),  [1-3]  is  to  determine 
the  significance  of  each  'component'  (in  this  paper,  these  'components' 
are  the  individual  controller  states  )  by  assigning  a  metric,  called 
'component-cost',  to  each  component.  Then,  a  reduced  order  controller 
is  obtained  by  deleting  those  controller  states  that  have  the  smallest 
component  costs.  These  ideas  are  extended  to  controller-reductions 
as  follows. 

From  the  structure  of  C  in  (2.4d)  and  Q.  in  (2.9c),  and  by 
partitioning  X  as 

X  ^  lim  E{x(t)xT(t)} 

t-~  (3.1) 

X  =  lim  E{x(t)xT(t)} 
t-~ 


the  expression  for  the  cost  functions  V  in  (2.8a)  can  be  rewritten  as 


V  =  Tr[CTQCX]  +  Tr[GTRGX]  .  (3.2) 

T 

We  note  that  the  'Control  Cost'  (Tr[G  RGX] )  may  be  decomposed  into 
contributions  from  each  controller  state 


such  that 


^  (x) 


A  1 
=  7 


lim  E  {  x.} 

t-*»  8X^ 


(3.3a) 


l/(x)  =  lim  E|ju(t)i|p 
t-x» 


n^ 

=  Tr[GTRGX]  =  £  I /.(x)  .  (3.3b) 
i=l  1 


The  'Regulation  Cost'  V(x)  is  the  remaining  term  in  (3.2) 


VrWlvtvV 


(3.3c) 


so  that 


V(x)  -  Tim  E||y(t)|L 


V  =U(x)  +  V(x)  . 


=  Tr[C  QCXJ 


(3.3d) 


As  a  passing  remark  we  mention  that  when  Sc(nr)  is  optimal  LQG- controller, 
we  have  =  X  and  X  =  P+X,  with  P  satisfying  (1.3d)  and  X  satisfying 
[4] 


X(A+BG) ‘  +  (A+BG)X  +  FVFT  =  0  . 


(3.3  ) 


It  follows  from  the  definition  (3.3a)  and  the  derivations  in  [2]  that 


=  [GTRGX]..  ,  i  -  1,  2 . nr 


(3.4) 


According  to  the  CCA  theory  the  controller  states  x^  associated  with 
the  smallest  values  of  l^(x)  are  deleted  from  the  controller. 

Now,  the  optimal  controller  Sc(nr)  may  not  be  minimal  (in  the 
sense  of  Kalman),  even  if  the  plant  S(nr)  is  minimal  [9].  This 
means  that  the  sufficient  conditions  of  lemma  1  may  be  satisfied  and 
hence,  there  may  exist  reduced  order  controllers  which  are  still 
optimal,  i.e.,  I(nr,nc)  =  0  for  nc < nr.  However,  for  brevity  in  the 
presentation,  it  will  be  assumed  henceforth  that  the  controller 
(1.2)  is  completely  controllable.  (Refer  to  [10]  for  extension  of 
this  work  to  the  general  case).  This  assumption  implies  that  the 
condition  (2.15c)  of  lenma  1  will  not  be  satisfied,  but  (2.15b)  may 
still  be  satisfied. 


V*  t  *Al  J  m  ■**»■<*  •  f  %  yV 


In  the  context  of  model  reduction,  a  set  of  coordinates  called 
Cost  Decoupled  Coordinates  was  presented  in  [2].  These  coordinates 
cause  the  largest  number  of  component  costs  U.(x)  to  be  zero.  Appli¬ 
cation  of  the  CCA  theory  in  these  coordinates  yields  a  reduced  model 

of  order  n  *  rank  [C]  with  the  property  that  \J0  -  V  .  Hence  such  reduced 
r  K 

order  models  are  called  "cost-equivalent  realizations"  [2].  Analogously, 
we  define  Controller  Cost  Decoupled  Coordinates  as  follows. 

Definition  3.  The  Controller  COst-DEcoupled  (CODE)  coordinates  are 
defined  by  the  following  properties : 


(PI)  X  =  I 


(3.5a) 


(P2)  GTRG  =  diag  (y,2,  •••,  y2  ,  0,  •••,  0}  (3.5b) 

1  rl 


2  2  2 
Y-j  >  ^2  1  •••  1  Yr  >0  » 


(3.5c) 


where  r^  =  ranfc[G]  <_  m  . 

These  CODE-coordi nates  are  non  unique  as  established  by  the  follow¬ 
ing  proposition. 

2 

Proposition  1:  Under  the  assumption  of  distinct  y..  ,  i=l,  2,  ..»  in 
(3.5b),  the  CODE-coordinates  are  unique  within  a  similarity  transforma¬ 
tion  x  =  T  x^,  of  the  form 

Pi  °" 

T  = 

0  To 


where 


T-j  =  diag  {+1,  +1,  ....  +1}  eR 


rlxrl 


V  '* ■  - V-'V- V *  V  N  V*  V* * •  '*  V 
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T2eR 


(nr-r1 )x(nr-r1 ) 


ie  any  orthonormal  nutvix. 

Proof:  Since  the  covariance  of  the  transformed  coordinates  is  given 
by 

lim  E{xN(t)xJ(t)}  =  Tim  T'1  E{x(t)xIt)}T'T  , 
t-*-°°  t-*-® 


the  matrix  T  must  be  orthonormal  to  satisfy  (3.5a).  The  state  weight¬ 
ing  matrix  G^RG  in  the  transformed  coordinates  is  tVrGT,  which  should 
satisfy  (3.5b,c),  i.e. 

TTGTRGT  =  diag  {y*.  . yj;  ,  0 . 0}  . 

(3.6a) 


Now  (3.5b,c)  imply  that 


and 


where 


mxr, 

G  =  [Gr  0];  G1  tH  1 

g|rg1  =  A-j , 


A-|  =  diag  {y^ 


2 

y2, 


Hence  partitioning  T  (which  must  be  orthonormal}  as 


T  = 


rewrite  (3.6a)  as 


(3.6b) 

(3.6c) 


/.  S', 


■'\\VJy-, 


Two  of  the  equations  resulting  from  (3.6c)  are 


which  can  be  satisfied  if  and  only  if  =  0,  T^  *  0,  and  is  as 
defined  in  ProDOSition  1.  # 

Hence,  there  exists  a  considerable  flexibility  in  the  cost 
decoupled  coordinates:  from  the  structure  of  the  transformation  in 
proposition  1  and  from  (3.6b),  notice  that  the  CODE-coordi nates  are 
not  uniquely  defined  within  the  null  space  of  G.  This  non  uniqueness 
will  later  be  used  to  obtain  a  generalized  Hessenberg  representation 
while  remaining  within  the  class  of  CODE-coordi nates. 

Now,  in  any  CODE-coordi nates,  the  controller  has  the  following 
property  (in  addition  to  property  (3.5)). 


Proposition  2:  Any  controller  S£(nr)  in  CODE-aoordinates  satisfies 
the  following  property  in  addition  to  (PI)  and  (P2) 


*  1  '  ■ 

Vx>  ■ 

l  0  1>ri  • 


The  proof  follows  immediately  from  the  substitution  of  (3.5a)  and 
(3.5b)  into  (3.4).  * 


Proposition  3:  The  component  costs  associated  with  the  cost  decoupled 
controller  coordinates  are  minimally  sensitive  to  perturbations  in  the 
weighting  matrix  G^RG. 

Proof:  The  proof  relies  on  a  result  derived  by  Skelton  and  Wagie  [12] 
which  is  restated  here. 


Lema  2  [11]  Let  be  an  eigenvalue  of  a  matrix  A.  Then  the  sensiti¬ 
vity  of  A.j  to  perturbations  in  A,  measured  by 


A  3A .  « 

S  .  r  ii  1  ii  2 


liar'll' 


» 


2  ^  m 

where  ||(-)ll  =  Tr[(*)  (•)]»  is  bounded  from  below  by  S^  >_  1  and  S^ 

takes  on  its  minimum  value  S.  *  1  if  and  only  if  the  matrix  A  is 
normal  (i.e.  Aa"'"  =  A^A). 

Now  note  from  proposition  1,  that  the  component  costs  (x)  are 
the  eigenvalues  of  GTRG,  i.e., 

^(x)  =  A.j  (GTRG)  ,  i  =  1,  2,  ....  nr  .  (3.7) 


Since  G^RG  is  syimetric  (hence,  it  is  normal),  the  proof  follows  from 
lemna  2  and  (3.7).  # 

The  implications  of  proposition  3  is  the  following.  Consider 
the  case  when  the  controller  state  weighting  matrix  G^RG  in  the  cost 
function  is  subject  to  perturbation  -  this  may  happen,  for  example, 
when  the  control  weighting  matrix  R  is  changed.  In  this  situation, 
it  is  shown  that  the  choice  of  the  reduced  controller  is  least  sensi¬ 
tive  to  these  perturbations. 

Note  from  Proposition  2,  that  all  the  controller  components 
x.j ,  i  >  r.j  are  candidates  for  truncation,  since  their  corresponding 
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component  costs  are  zero.  Hence,  a  reduced  controller  obtained  by 
truncating  some  of  these  zero  cost  states  seems  attractive.  We  will 
now  evaluate  such  a  reduced  controller,  Sc(nc). 

Theorem  1:  The  error  indices  7 (n^ , nc )  and  I(nr,nc)  associated  with  the 
reduced  controllers  Sc(nc  d.  ri )  obtained  by  truncating  t  =  nr~nc 
CODE-controller  states  from  S^( nr)  satisfy  the  following 

(i)  I(nr.  nc)  =  0  nc>r1  (3.8) 

(ii)  If  (2.6)  is  stable,  then 

nc]  =  V  trt°R  ^WXR^  ’  nc  -  rl  (3.9) 

where  (XR-XR)  satisfies 

(Xr-Xr)Ar  +  AR(XR-£R)  -  U(nc)  *  0  (3. TOa ) 

and  where  AR,  CR  and  Q  are  as  defined  in  (2.4),  (2.9c)  and 
”0  0  0 
U (nc)  ®  0  0  (Art-FrMt) 

-°  ^ART‘FRMT^  0 

(ill)  In  (2.13b),  u(nc)  =  1  Tr[CR(2CR]  is  a  constant  for  nc>r.j 

(3.11) 

Proof:  Since  nc  >  r-j  in  view  of  (3.6b)  and  (2.4d),  (2.12a)  yields 
(3.8).  Now  since  Cy  =  0,  (2.12b)  reduces  to  (3.9)  with  (xR-XR) 
satisfying  (2.12c).  Comparing  (3.10a,b)  with  (2.12c),  we  need  to 
show  that  u  =  u(nc)  as  defined  in  (3.10b).  Recall  from  (2.4b)  that 
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art‘frmt 


(3.12a) 


since  Gy  =  0  and  Ac  =  ART  +  BRGy  -  FRMy.  Also  recall  from  (2.4a)  that 

RT 


=  lim  E{xR( t )  x y ( t ) } 
t-*« 

XR(t)  xj(t) 

=  lim  E{  Xy(t)  x y ( t ) 
t-K*> 

xR(t)  x|(t) 


Now,  using  the  property  that 

K(tn 


X12  =  lim  E(  [xRT(t)  XyT(t)]} 

t-«°  Xy  (  t  ) 


and  (3.5a),  XRy  is  shown  to  be 


*R(t) 

limE{  .  [xl(t)x^(t)]} 

t-«°  x  (t)  K 


*RT  =  lt  • 


(3.12b) 


Substituting  (3.12)  in  (2.1 2d )  to  show  that  U  =  ti(nc)  proves  (ii).  Since 

Cy  *  0  for  nc  _>  r^  the  y(nc)  defined  in  (2.13b)  is  the  same  constant  for 

all  nc  _>  r^  and  hence  ( i i i )  holds.  # 

Now,  since  Cy  =  0,  Xy  lies  in  the  nullspace  of  the  output  matrix  of 

(2.3),  its  observability  is  measured  by  ARy,  i.e.,  the  "smaller"  (in  a 

sense  to  be  made  precise  later)  is  ARy,  the  less  observable  is  Xy.  In 

fact,  if  ARy=  0,  then  Xy  is  unobservable.  Furthermore,  ARy  =  0  implies 

Ac  =  A Ry  -  fRMj  =  0  and  Xy  is  unobservable  in  Sc(nr).  Hence,  by  lerrma  1» 
RT 


.  v  v-  -Vvv*;. 

*.  V  %  •.  A.  A  L  V  -  r  .  r 


v ' ’  V V-*%’ ^ 

l  >  »  -W  m  " 
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I(nr,nc)  =  0.  Therefore,  in  CODE-coordi nates  notice  that  I(n  ,n  ),  is 
influenced  by  the  observability  of  the  'truncated'  controller  states. 

However,  in  general,  A^j  will  not  be  zero.  But  there  may  exist  some 
xi  eXy  which  are  either  unobservable  [in  which  case  a  truncation  of 
them  would  yield  l(nr,nc)=0]  or  "nearly"  unobservable,  [in  which  case 
I(nr,nc)  would  be  small]. 

In  order  to  identify  these  "nearly"  unobservable  states,  we  make 
use  of  the  transformation  in  proposition  1  to  obtain  a  representation 
°fSc(nr)  in  a  convenient  set  of  CODE-coordi  nates  having  the  following 
structure.  [The  computational  details  are  given  in  Appendix  A]. 
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and  where  r.  =  rank  (A.  .  . ),  i  =  2,3,  ...»  p,  and  the  matrices  A.  .  . 

1  ■  - 1 » *  i-l,i 

are  compatibly  dimensioned. 

Representations  of  systems  in  the  form  of  (3.13)  have  been  used 
in  literature  in  different  contexts.  For  example,  Tse  et  al.  [13]  and 
Perkin  et  al .  [14]  have  used  Generalized  Hessenberg  Representations  (GHR) 
in  the  context  of  model  simplifications.  In  [13]  and  [14]  GHR  is  used 
"...  for  detecting  weakly  observable  subsystems".  Also,  if  A..  ^  is 
"small"  in  the  sense  of  some  reasonable  criteria,  a  reduced  model  is 
obtained  by  neglecting  (truncating)  x. ,  j>i.  Clearly,  from  the  pre- 

J 

sence  of  the  trailing  zeros  in  the  matrix  G  in  (3.13),  it  is  seen  that 
if  for  some  i,  A.  • , ,  *  0  (or  "small")  then  x.,  j>i  are  unobservable 

' J 

(or  nearly  unobservable).  The  representation  (3.13)  differs  from  the 
GHR  in  the  sense  that  G^lm  which  is  required  of  a  GHR.  However,  the 
statements  made  above  are  also  applicable  for  the  controller  in  (3.13). 
Another  context  where  such  a  representation  is  used  is  in  extracting 
a  minimal  realization  from  a  non-mi nimal  realization  of  a  given  trans¬ 
fer  function  [12],  where  again  it  is  used  to  detect  the  unobservable 
subspace.  The  algorithm  presented  in  Appendix  A  is  almost  identical 
to  that  used  in  [12].  Due  to  these  similarities  some  of  the  results 
from  [13]  can  be  specialized  to  the  controller  reduction  context.  In 
particular  we  have  the  following. 


Conjecture:  If  for  some  i,  j| A.  .  ,||^  is  much  smaller  than  ||A.  || , 


^  ||X(|  =  a^^(XTX)  =  maximum  singular  value  of  X. 
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s 


A 


m 


SQ(nc):  p  *  Aq  p  +  FpZ  ,  peR 

R 


UR  =  GR  p 


♦  "c  -  L  rj 
c  j-i  J 


where  6R  =  [G-j,  0,  ....  0],  obtained  by  deleting  the  controller  states 
Xj.  j  =  i+1,  ....  P  yields  a  small  controller  error  index  7(nr,nc). 

To  show  this,  note  from  the  structure  of  the  controller  in  (3.13)  and 
from  the  definition  of  A,.  that 


(nc"ri >*ri+i 

Ar  •Art-FrMt-  -"a"1-11 
Crt  Ri  k  i  Ai>i+1 


ncx(t-ri+l ) 


Hence,  from  the  definition  of  U(nc)  in  Theorem  1,  small  ||A^  j+^ll  implies 
small  ||u(nc)jj.  Furthermore,  if  AR  is  stable,  the  solution  tc  (3.10a)  is 


A  f  ^pt 

(Xr-xr)  •  J  e  R  U(nc)  e  K  dt  , 


from  which  we  get 


XR-X„l|fj  II  II  U(nc>  II  -  II  e/ 


Hence,  from  (3.10b)  small  ||A^  ^  j|  leads  to  small  || U (nc ) ] |  and  hence  from 
(3.3)  to  small  I(nr,nc). 

Recall  that  all  the  controller  states  x^ ,  i  >  r-j  have  zero  controller 
costs  u  (x)  and  that  the  controller  states  ,  i=l,2,...,r^  are  ordered 


according  to  (3.5b).  This  means,  since  y.  =  V.(x),  that  x.  is 


1  i 


significant  than  xi+1.  i  =  1,  2,  ....  r-j .  Hence,  if  the  order  of  the 
controller  is  predetermined  to  be  nc  <  r^ ,  then  the  controller  states 

xi  ’  i="c+1 . nr  are  to  be  truncated.  Therefore,  for  nc  <_  r^ ,  there 

is  no  ambiguity  of  the  controller  states  to  be  truncated,  and  the  addi¬ 
tional  complexities  of  the  GHR  type  structure  (3.13)  need  not  be  computed 
It  is  only  when  nc >  r^  that  one  requires  an  additional  criteria,  such  as 
observability  of  the  components  x..,  i  >  r-j ,  to  determine  which  of  these 
xi  are  to  be  truncated.  It  is  this  requirement  that  led  to  the  develop¬ 
ment  of  (3.13).  Clearly,  more  investigation  is  required  in  the  study 
of  the  properties  of  the  controller  (3.13). 


Stability  Properties 

Of  primary  concern  to  any  controller  reduction  scheme  is  the  stabil¬ 
ity  properties  of  the  closed  loop  system.  The  following  theorem  and 

corollaries  present  the  known  instability  properties  of  S  (n  ).  It 

c  c 

is  assumed  that  the  controller  is  represented  as  in  (3.13). 

Theorem  2:  Let  -Sc(nc)  be  a  reduced  controller  obtained  by  truncation 
from  Sc(nr).  Then  the  closed  loop  system  (2.6)  is  asymptotically 
stable  only  if  Sc(nc)  is  controllable,  i.e.,  only  if  the  matrix  pair 
{(Ap+BRGR),  Fp}  is  controllable. 

Proof:  This  is  proved  by  showing  that  if  Sc(nc)  is  not  controllable  i.e. 
if  thr  matrix  pair  {(AR+BpGp-FRMp) ,  FR}  is  not  controllable,  then  AR 
in  (2.6)  is  not  asymptotically  stable.  (Note  that  uncontrollability 
of  {(Ap+BpGp),  Fr}  is  equivalent  to  uncontrollability  of  { (AR+BpGp-FpMR) , 
Fr)  [15 J). 
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Now,  let  (n£ )  be  uncontrollable  and  for  simplicity  assume  that 
the  matrix  (AR+BRGR)  has  n£  distinct  eigenvalues.  Then  there  exists  a 
left  eigenvector  of  (AR+BRGR)  such  that 


°i*FR  =  0  *  VC 


(3.14a) 


and 


°i*  ^AR+BRGR^  *  Xi  °i 


(3.14b) 


for  some  ie{l,2,...,  nc>,  where  A,.  eA  (Ar+BrGr)  and  *  denotes  the  conju¬ 
gate  transposition. 

Now,  since  the  controller  states  x  are  covariance-normalized,  i.e., 

X  =  lim  E{x(t)  xT(t)}  =  I  , 
t-»  nr 

(3.3d)  reduces  to 

(A+BG)T  +  (A+BG)  +  FVFT  =  0  (3-15) 

which  can  be  written  in  its  partitioned  form  as 


(AR+BRGR)T  (ATR+BTGR^T 

+ 

(ar+brgr^art+brV 

+ 

FRVFRT 

frvftT 

_(art+brgt)T 

[atr+btgr^at+btgt^_ 

[tvfrT 

ftvfJ_ 

=  0 


The  upper  left  corner  of  this  equation  yields 

<V¥r)t  +  <Wr>  +  frvfrt  -  o  • 


(3.16) 


Now,  pre-  and  post-multiply  (3.16)  by  a.,  and  ai  respectively  to  get 

ai  (AR+BRGR)T  ai  +  ai*(AR+BRGR)  ai  +  ai*  FRVFRT°i  =  0  '  (3J7a) 


Kpvwvwm  ivm 
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In  view  of  (3.14)  and  the  conjugate-transpose  of  (3.14b)  this  reduces  to 

Xj  a,.  a..  +  A.j  =  2  Re ( ) - 1| ^  =  0  .  (3.17b) 

where  Re{*)  denotes  the  real  part  of  (•)•  Clearly  from  (3.17b) 


Re(x.)  *  0  . 


Now  consider  AR  where 


(3.18) 


*  A  *  r 

ni  =  [0,  0,  a.  ]  ;  ni  tC 


Then 


ni  ar  =  C°*  °*  °i  ^ 


*R 

^R 

FRMR 


mRT 

at 

frmt 


brgr 

Vr 

Ar 


=  [a.  FpMp,  ai  FrH,.  ,  a,-  ACr] 

=  [o,  0,  Aj.  ] 

K 

where  (2.4b)  and  (3.14a)  have  been  used.  Now  since  Ac  =  ar+brgr  " 

R 

FrMr,  we  get 

H-j  ar  =  Cg»  0,  0i  (Ar+BrGr  -  frmr)] 

=  [0,  0,  a.*(AR+BRGR)]  (3.19a) 

=  [0,  0,  A.,  a,.  ]  =  A^n^ 

where  (3.14)  has  been  used.  Hence  from  (3.19a)  we  see  that 

A.  e  A(AR)  .  (3.19b) 
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Therefore  from  (3.18)  and  (3.19b)  is  not  asymptotically  stable.  # 

Corollary  1:  If  the  zeroth  order  Markov  Parameter 1  of  Sc(nr)  is  zero , 
then  the  resulting  closed  loop  system  (2.5)  for  any  5£(nc  <  r^), 
is  not  asymptotically  stable. 

Proof:  Since  the  Markov  Parameters  are  invariant  under  similarity 

transformations,  the  zeroth  order  Markov  Parameter  of  S Jn„)  is  zero  if 

C  r' 

and  only  if 

*  G1F1  =  0  ,  (3.20) 

where  the  matrix  F  of  Sc(nr)  is  partitioned  as 

F  = 

Now,  since  rank  (G-|)  =  r  <_  m,  (3.20)  is  satisfied  if  and  only  if  F-,  *  0. 
Therefore  from  (2.2)  and  (2.5),  FD  =  0  for  all  n„  <  r.  and  hence  S  (nr  <  r. ) 
is  not  controllable.  The  proof  now  follows  from  Theorem  2.  # 

The  final  controller  reduction  procedure  is  summarized  in  algorith¬ 
mic  form  in  Appendix  A. 

We  now  demonstrate  this  controller  design  scheme  with  the  aid  of  an 
example  in  the  next  section. 


F.  e  R 


(nr-r] )xz 


GF  =  [G-j ,  0] 


JThe  i-th  order  Markov  Parameter  J.  of  S(n  1  is  defined  as 

^  c  r 

Ji  =  GtA+BG-FM)1 F  ;  i  =  0,  1,  2,  ...  . 


.  \  ••  -  ‘.--v ■ 


«*, 


I  .V  . 


IV.  EXAMPLE 


The  example  considered  is  a  Solar  Telescope  (SOT),  schematically 
represented  by  Fig.  1. 


Fig.  1  Solar  Optical  Telescope  Structure 
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The  truss  structure  of  Fig.  1.  has  been  modeled  by  finite  element  methods 
in  [11].  This  SOT  was  originally  described  by  44  modes,  but  the  model 
was  reduced  to  10  modes  by  a  Modal  Cost  Analysis  [16].  (This  detail  is 
not  given  here).  The  reduced  model  of  10  modes  (8  elastic-  and  2  rigid- 
modes)  is  described  by 


+  2c.jU3.jri.j  +  =  b..  (u  +  w),  i=l,...,10 


(4.1) 


There  are  m  (=8)  force  actuators  whose  control  forces  u-,,...,ug  act  in 
the  z-direction,  located  as  shown  in  Fig.  1.  The  actuator  noise  (white) 
is  denoted  by  w  and  has  intensity  W  =  10  ^Ig.  u>..  is  the  frequency  of  the 
i-th  mode  and  is  the  damping  ratio  of  the  i-th  mode.  The  frequencies 
(ordered  by  modal  cost)  are  given  in  Table  1,  and  the  damping  ratios  are 
taken  as  «  0.001,  i  =1,  2,  ....  8  and  =  0,  i  =  9,  10  corresponding 
to  rigid-modes.  The  control  objective  is 


*  '  1-  E<  li*  II  Q  *  II  “Hr)  . 


Table  1 

Modal  Frequencies 


(4.2) 


2 

3 

0.914 

10.817 

Mode  # 


(rad/sec) 114.853  0.914  10.817  3.652  153.43  53.861  3.630  149.37  0  0 


(*:  #9  and  10  are  the  rigid  modes) 


5 

6 

153.43 

53.861 
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and  where  LOS  is  the  angular  displacement  of  the  optical  Line  of  Sight 


(LOS)  error  about  the  x  axis.  L0Sy  is  the  optical  Line  of  Sight  error 


about  the  y  axis.  Defocus  is  caused  by  changes  in  the  length  of  the 
optical  axis  (deflections  in  the  z  direction).  The  output  y  is  related 


to  the  modes  ^  by 


10 

y  ■  ,L  pi"i 


(4.3) 


The  output  weighting  matrix  is  chosen  as  Q  =  diag  {1,10,10  }  to  indicate 


that  line-of-sight  errors  about  the  y  axis  are  most  critical  to  the 


experiments.  The  available  measurements  for  the  control  law  implementa¬ 


tion  are 


Z  =  y+v  ,  ZeR  ,  £  =  3 


(4.4) 


where  the  noise  v  is  assumed  to  be  a  zero  mean  white  noise  with  intensity 


V  *  10”  1 3  to  reflect  the  uncertainties  in  the  measurements. 


This  second  order  representation  (4.1-4)  of  the  SOT  is  equivalently 


written  in  a  first  order  state  form  as  follows: 


x  =  Ax  +  Bu  +  Dw  ,  xeR 


y  =  Cx 


(4.5) 


z  =  Mx  +  v 


<  -  *  *  ".  V  \  “  »  ^  "  ■<_  "f  *  ^  *  \  a.  \  "V.  ».  ^  <  “  .  \  ^  V  vl 


where 


6  r  *  • 

“  LTl'|»ri2,***»T,,|0,r'l’ri2l 


!10 


]T  ,  nr=20 


1 

o 

*— « 

°J 

"o" 

II 

c 

2 

,  B  =D  = 

-ui  -  2  Cw_ 

J5_ 

C  =  M  =  [P  0] 

and  where 

u  -  d i aQ C tiJ-j  ,^2  ^  *  C  ~  0.001 


and  P  =  [P] »P2’  *  ’^10^  • 


The  matrices  8  and  P  are  given  in  Table  2.  Having  represented  the  SOT 
in  the  form  (4.5)  required  by  the  Cost  Decoupled  Controller  Design 
Algorithm  (given  in  Appendix  A),  this  algorithm  was  used  to  design 
reduced  order  controllers. 

Reduced  Order  Controller  Design: 

The  control  weighting  R  in  (4.2)  was  taken  as  R  *  plg  and  p  was 
varied  to  study  controllers  of  different  bandwidth.  For  each  p,  the  Cost 
Decoupled  Controller  Design  Algorithm  was  applied.  This  constitutes  the 
following  steps. 


Table 
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i)  Construct  an  optimal  controller  for  (4.5). 

ii)  Transform  the  controller  to  the  form  in  (3.13)  v.ith  properties  (3.5). 

iii)  Obtain  reduced  controllers  of  order  nc  by  truncating  the  last 

(np-nc)  controller  states  and  evaluate  these  controllers.  (Note, 
nc  =  nr  *  20  is  the  full  order  optimal  controller  for  this 
example). 


The  controller  design  algorithm  was  repeated  for  a  range  ofp  =  .01-100, 
and  reduced  controllers  of  different  orders  were  obtained.  The  per¬ 
formance  of  the  controllers  was  evaluated  and  is  presented  in  Fig,  2. 

The  different  labels  presented  in  the  following  figures  are  defined  by 


CONTROL  COST  =  lim  E|| u(t)  || 2 
t— 

REGULATION  COST  =  lim  E||y(t)||f: 

t—w  V 

CONTROL  EFFORT  =  [control  cost]172 

LOS(y)  =  lim  E || LOS  (t)||  . 
t—  y 

The  dotted  lines  indicate  lines  of  constant  p.  The  solid  lines  are 
continued  until  instability  occurs.  Fig.  2  illustrates  that  the 
Regulation  Cost  asymptotically  reaches  a  constant  value  ('vTOE-07) 
corresponding  to  a  control  cost  of  greater  than  1-6E-06N2  (control 
effort  >  1.3E-03  Newtons).  Fig.  2  and  Table  3  show  that  for  a  fixed 
level  of  control  effort  (8-5E-4  Newtons  for  Table  3),  controllers  of 
smaller  order  perform  worse  than  the  controllers  of  larger  order. 


Table  3  shows  that  the  4-th  order  controller  and  the  full  order 


(nc=20)  controllers  yield  essentially  the  same  RMS  performance.  Our 
explanation  for  this  is  that  the  relative  observability  conditions  in 
the  conjecture  on  p.  21  regarding  small  7(nr,nc)  happens  to  hold  more 
accurately  for  the  4-th  order  controller  than  for  other  reduced 
controllers.  However,  the  4-th  order  controller  has  a  lower  margin  of 
stability  which  is  indicated  by  this  controller  becoming  unstable  even 
for  a  small  increase  in  the  control  effort  beyond  CONTROL  COST  >  .07xlO"^N 
(See  Fig.  2). 


Table  3 

LOS (y )  vs.  nc  (Control  Effort  =  8.5E-4N) 


n 

c 

20 

18 

12 

10 

6 

H 

LOS(y).  Rad(xl0"3) 

0-28 

0-28 

0-29 

0-35 

0-37 

0-29 

I(nr,nc) 

0 

.0150 

.0196 

.0848 

.1301 

.0245 

Controllers  of  order  nc  =18  and  12  have  not  been  included  in  Fig.  2. 

From  these  figures  one  could  pick  a  design  to  meet  the  mission  objective 
For  example,  if  the  following  is  the  mission  objective: 

(i)  LOS(x)  S  5-0E-4  Rad. 

(ii)  LOS(y)  5  3-0E-4  Rad. 

and 

(iii)  Control  Effort  <  1-0E-3  Newtons, 
one  would  pick  a  6-th  order  controller  instead  of  nc >  6  so  that 
the  performance  specifications  are  met  with  the  least  amount  of  on-line 
controller  hardware/software. 
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Obviously,  none  of  the  reduced  controllers  (nc  <  20)  can  perform 
better  than  the  optimal  controller  (nc  =  20).  One  could  only  aim  to 
get  the  corresponding  performance  curves  close  to  the  optimal  curve. 
Note  that  the  reduced  order  Cost-Decoupled-controller  designs  in  Fig.  2 
are  close  to  the  optimal  curve  until  instability  occurs.  (The 
4th  order  controller  (NC=4)  eventually  went  unstable  in  Fig.  2). 


Comparison  with  Balanced  [5,6]  Controller  Design: 

Since  [5,6]  also  consider  LQG-based  design  problems  we  will  now 
compare  the  reduced  controllers  obtained  by  Cost-Decoupled  (CODE) 
controller  design  algorithm  presented  herein  with  the  LQG-balanced 
controller  design  method  proposed  by  Verriest  [5,6]. 

The  results  are  presented  in  Fig.  3.  Let  CODE  denote  the  con¬ 
troller  design  presented  herein  and  BAL  denote  that  of  [5,6].  The 
points  labeled  by  similar  numbers  indicate  that  the  controllers  were 
oltained  from  the  same  optimal  controller  (nc  =  20)  but  by  different 
methods.  In  this  example  the  CODE-control lers  performed  better  than 
BAL-control lers  and  this  difference  increases  with  smaller  order 
controllers.  For  the  values  of  p  tried  the  LQG-balanced  method  did 
not  yield  any  stable  controllers  of  orders  less  than  10  even  for  low 
control  effort.  The  Cost  Decoupled  Controller  Design  Algorithm  result¬ 
ed  in  stable  reduced  controllers  of  order  less  than  10  as  shown  by 
Fig.  2.  For  a  controller  order  n£  =  10,  the  smallest  line-of-sight 
(LOSy )  error  achieved  by  the  LQG-BALANCED  design  was  0.4xl0~3  rad., 
and  the  smallest  achieved  by  the  Cost-Decoupled  design  was  0.03x10 
rad. 


*  j  f  .  .  .  •  ^  rf*  v-  r  «  ■  *  .■  >  "  *  ***** 

g-'i  v:  .  '■*  -W.' Vi  a  .--a'  ■l’li  iflii  iV>  ifl 
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VI.  CONCLUSIONS 

The  concepts  of  Component  Cost  Analysis  (CCA)  utilizes  the  contri¬ 
bution  of  states  to  a  quadratic  cost  function  as  a  metric  to  measure 
the  significance  of  the  states.  Using  these  concepts,  a  controller- 
reduction  algorithm  is  proposed.  Controller  error  indices  are  defined 
to  measure  the  'quality'  of  reduced-controllers  and  expressions  are 
derived  for  their  computations.  Upper  bounds  are  also  obtained  for 
these  indices.  The  dependence  of  these  indices  on  the  observability 
of  the  controller  states  that  are  truncated  is  also  shown. 

Employing  CCA  to  reduce  these  error  indices,  a  set  of  Cost  Decoupled 
Controller  Coordinates  are  developed.  The  representation  of  the  optimal 
controller  in  these  coordinates  closely  resembles  the  Generalized 
Hessenberg  Representation.  The  truncated  controller  states  in  this 
representation  have  the  following  properties: 

(i)  smallest  controller  component  costs, 

(ii)  the  component  cost  is  least  sensitive  to  the  control 
weighting  matrix  R, 

(iii)  least  observability  in  the  controller, 

( i v )  uncorrelation  from  the  retained  controller  states,  and 

(v)  they  have  the  least  dynamic  interaction  with  the  retained 
controller  states. 

Necessary  conditions  have  been  derived  for  the  stability  of  the 
closed  loop  system  when  driven  by  the  reduced  order  controllers.  The 
conditions  are  shown  to  be  related  to  the  Markov  Parameters  of  the 
full  order  optimal  controller.  A  Solar  Optical  Telescope  is  used  to 
illustrate  the  design  procedure.  The  resulting  controllers  are  compared 
with  those  obtained  by  the  LQG-balanced  method  proposed  by  Verriest  f 5 , 6] . 
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APPENDIX  A.  The  Cost-Decoupled  (CODE)  Controller  Design  Algorithm 
Given  a  model. 


=  A  x 

+  B  u  +  D  w;  w  'v  n(o,W) 

=  C  x 

,  xcR  r,ueRl\weRd,yeRk,ZeR£} 

(A.  1 ) 

*  M  x 

+  v  ;  v  ^  n(o,V) 

the  objective  of  this  algorithm  is  to  construct  a  controller  in  the 
form  (3.13)  with  the  properties  (3.5),  to  reduce 


i 

i 


(A. 3f) 


p  AT  +  A  P  -  P  MT  V']M  P  +  D  WDT  =0 
Ie.  Compute  X  by  solving 

X(A  +  B  G  )T  +  (A  +  B  G  )X  +  FVFT  =  0  (A. 4) 

-I 

ld.  Compute  the  square  root  of  X  t 

*■  ex  exT  <A-5> 

le.  Compute  ©u,  the  orthonormal  modal  matri)?  of 


ex^G^RGex  such  that 


T.T 


iu  ex  G  RG©X©U  =  diag{l/1(i),l/2(i),...,l/ri(x),0...0}  (A.6) 


where  l/,(x)  t  Mx)  1  •  *  *  l  V  (x)  >  0. 
i  c  rn 


II.  I la.  Define  =  ©x  ©u 


(4.7) 


lib.  Define 


A11  A1 2 


A  A* 

21  22 


■1 


AneR 


rixri 


T1  ‘Ac  T-|  J  (  (nr-r1  JxCnj.-^ )  (A. 8a) 
A2  R 


-1  rlxi 

=  T-j  F  ,  F^R  1 


(A. 8b) 


1.  In  case  of  uncontrollable  controllers,  singular  value  decomposition 
may  be  used  in  (A. 5),  to  help  factor  out  the  uncontrollable  subspace. 

2.  For  this  task  use  singular  value  decomposition  til]  or  use  an 
eigenvalue/eigenvector  program  specialized  for  symmetric  matrices. 


Ilia. 


*♦ 

•i 

•i 

A 

.1, 


Obtain  singular  value  decomposition  of  A.  .  .a 

1-1,1 


A,  •,  =  U  L  J  „  i 


i-1  .i 


where  a  =  diag{o1  ,o2  .....or  ;  Oj  i  >  ( 


i  vrxri 

and  V,  eR  r 


Define  T,  = 


c.  Set  r  =  r  +  r. ;  if  r  =  nf  go  to  IV 


Define  Vitl  ‘ 


A  a  v  a  V 

Hi+1 ,i+l  v2  Ai ,i v2 


e.  Set  i  =  i+1  and  p  =  p+1  and  repeat  III 


Define  T  =  n  T. 


*,  s 
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V.  The  reduced  controller  of  order  r  is 

p  =  Ar  p  +  FDz  ,  dert 


UR  -  GR  0 


where 


\ ■  w* 

fr  *  V  • 

GR‘STR  • 


and  where  TR  and  LR  are  obtained  from 


_  -  n»xr 

[T„  Tt]  =  T  ,  Tr  eR  r 


K  4-i  rxn 

*  L  »  T  ,  LR  er  r 

h  R 


(A. 13a) 


(A. 13b) 
(A. 13c) 
(A. 13d) 


(A.13e) 


(A. 1 3f ) 


SRHK 


m 
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APPENDIX  B.  Proof  of  Lenina  1 
Rewrite  (2.8a)  as 

V  *  TrEC^C^]  +  Trtcjac^]  +  (B.l) 

where  XR,  XRT  and  Xy  satisfy 

Vr  +  Vr  +  %ART  +  ART*RT  +  ^R^R  =  0  ^B'2a) 

'VtR  +  %AT  +  AR%  +  ART*T  +  ^R^T  =  0  (B .  2b) 

Vt  +  Vt  +  ^r/tR  +  ATR*RT  +  V1WJ  =  0  *  ^B-2^ 

The  equations  (B.2)  are  obtained  by  partitioning  equation  (2.9a). 

Equation  (2.12a)  follows  from  the  substitution  of  (B.l)  and  (2.10b) 
in  (2.10a),  and  (2.12b)  follows  from  (B.l),  (2.11)  and  (2.8b).  Equation 
(2.12c)  is  obtained  by  subtracting  (B.2a)  from  (2.8b). 

To  prove  (2.13a),  use  (2.8b)  and  (2.9b)  to  rewrite  (2.11)  as 

l(VnC)  =  V  TrCCR^CRxR3  -1  •  (B-3) 

Now  use  the  following  identity  [18] 

*m(X)Tr[Y]  <  Tr[YX]  <  XM(X)Tr[Y]  ,  for  X  >  0,  Y  >  0 

in  (B.3)  to  get 

V  Am^XR^Tr^CR^CR^"1  —  ^nr’nc^  —v  xM^XR^Tr^CR^CR^_1  ’ 
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Clearly,  from  the  definition  of  CR  in  (2.4d),  y  as  defined  in  lerrnia  1 

depends  on  n£.  (2.13a)  is  proved  by  observing  that  I(nr,  nc)  _>  0. 

(2.14a)  is  proved  by  equating  (2.12a)  and  (2.12b).  To  prove  the 

sufficient  condition  (2.14b)  assume  without  loss  of  generality  that 

(2.2)  is  in  observable  canonical  form,  so  that  Xy  is  unobservable 

(i.e.,  Aj.  =  0,  Gy  =  0).  Hence,  from  the  definition  of  ARy  in  (2.4b), 
RT 

art  *  Now,  since  (2.2)  is  the  optimal  controller,  (2.3)  is 
asymptotically  stable.  Therefore,  with  ARy  =  0, 


A(A)  =  A(Ar)  U  a(At)  c  C' 


(B.4a) 


where  a(-)  denotes  the  eigenvalues  of  (•)  and  C  represents  the  open 
left  half  complex  plane.  Hence 

a(Ar)  CC"  .  (B.4b) 

Stability  of  AR  guarantees  that  the  solution  to  (2.12c)  is  XR-XR  -0, 

since  ART  =  0.  Hence  recognizing  that  Cy  =  0  if  Gy  =  0,  (2.14a)  is 

satisfied.  To  prove  (2.14c)  let  (2.2)  be  in  controllable  canonical 

form  so  that  xT  is  uncontrollable  (i.e.,  Ar  =  0,  FT  =  0).  From 

iR 

(2.4b)  Ay R  =  0,  hence  (B.4)  holds.  Also  note  that  from  (2.4c)  Vj  =  0, 
yielding  Xy  =  0  and  XRy  =  0  as  the  solutions  to  (B. 2b)  and  (B.2c) 
since  A(Ay)C.c".  Furthermore,  the  solution  to  (2.12c)  is  XR-XR  =  0, 
since  xRT  =  0.  Since  xRT  =  0  and  XR-XR  =  0,  (2.14a)  is  satisfied. 

This  proves  (2.14c)  and  (2.1 4d) . 

The  proof  of  the  sufficient  conditions  (2.15b)  and  (2.15c)  follows 
identical  steps,  and  equation  (2.15a)  is  obvious  from  (2.12b).  # 
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Abstract 


Algorithms  are  given  to  determine  the  critical  inputs  and  outputs 
in  a  linear  system  evaluated  by  quadratic  performance  criteria.  This 
type  of  analysis  is  referred  to  as  "input  cost  analysis"  (ICA)  and 
"output  cost  analysis"  (OCA).  The  fundamental  concept  is  to  decompose 
the  quadratic  performance  metric  into  contributions  from  each  input/ 
output.  This  type  of  "cost-decomposition"  has  application  in  the  selec¬ 
tion  of  best  sensors  and  actuators  in  engineering  control  systems,  and 
that  application  is  the  focus  of  this  paper. 


1.0  INTRODUCTION 

The  problem  under  consideration  is  the  time-invariant  linear  dynamic 
system  described  by 

(1.1a)  x  =  Ax  +  B(u  +  w) 

(1.1b)  y  =  Cx 

(1.1c)  z  =  Mx  +  v 

and  evaluated  by  the  performance  metric 

0.2)  V  =  lim  E(||y(t)||g  +  [|u(t)|||)  .  ||y|ln  *  yTPy  . 

Where  the  zero-mean  white  noise  disturbances  are  described  by  Ew(t)  =  0, 
Ev(t)  =  0,  Ew(t)wT(x)  =  Wfi(t-r),  Ex(o)  wT(t)  «  0,  Ex(o)vT(t)  =  0, 

Ev(t)vT(x)  =  Vfi(t-T),  W  >  0,  V  >  0.  The  notation  "Q  >  0"  means  that 
"Q  is  a  positive  definite  matrix".  The  superscript  T  denotes  matrix 
transposition,  and  the  notation  yeR*,  UeR^zeR^,  xeRn,  WeRm  indicates 
the  dimensions  k,  m,  E,  n,  m  of  the  real  vectors  y,  u,  z,  x,  w  respec¬ 
tively.  It  is  assumed  that  rank  B  =  m,  rank  M  =  l,  and  rank  C  =  k. 

This  eliminates  the  possibility  of  redundant  inputs/outputs.  Now  suppose 
that  m  >  1.  (In  fact  in  some  large  scale  systems  m  is  quite  large  [7].) 

One  might  wish  to  know  which  of  the  inputs  have  a  greater  effect  on  the 
responses  that  contribute  to  the  performance  metric  V  in  both  the  open 
loop  (u=0)  and  closed  loop  (u^O)  cases.  We  shall  refer  to  these  inputs 
as  the  more  "critical"  inputs.  Generally  speaking,  there  are  three  reasons 
for  using  input  cost  analysis: 

(i)  For  the  open  loop  case,  the  "critical"  inputs  can  be  retained  in  the 
model  and  the  others  deleted  if  a  simplified  representation  of  the 
system  is  desired,  using  fewer  inputs. 
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(ii)  If  the  inputs  evolve  from  actuator  devices  of  varying  degrees  of 
reliability  (i.e.  closed-loop  case),  the  "critical  actuators  suggest 
which  actuator  devices  should  be  made  more  reliable. 

(iii)  If  the  inputs  evolve  from  actuator  devices  with  m  admissible 
locations  throughout  the  engineering  system,  only  the  m  <  m  most 
"critical"  actuators  might  be  retained  if  the  system  is  to  be 
designed  using  only  m  <  m  actuators.  In  this  way  the  optimal 
location  of  m  actuators  is  sought  from  an  admissible  set  of 

m  >  m  actuators.  In  fact,  when  noisy  actuators  are  considered,  the 
set  of  m  actuators  may  yield  better  performance  than  the  total  set 
of  m  actuators. 

A  similar  set  of  circumstances  and  questions  exist  concerning  outputs 
of  the  system. 

The  idea  of  decomposing  the  performance  metric  V  in  terms  of  contri¬ 
butions  from  each  input  was  presented  in  [1]-  [3]  and  this  analysis  is 
referred  to  as  "Input  Cost  Analysis"  ( I C A ) .  Similarly,  "Output  Cost 
Analysis"  (OCA)  was  also  presented  in  [l]-[3].  This  paper  develops  ICA 
and  OCA  for  the  more  difficult  case  of  closed-loop  systems,  whereas 
[2]  and  [3]  are  limited  to  open-loop  systems.  In  [1]  and  [2],  the 
definition  of  the  itb  "input  cost"  lAb  is 


(1.3a) 


b  A  1  31/ 


v  r  k 

3b.  i 


B  =  tbl . bm^ 


and  the  definition  of  the  ith  "output  cost"  IAC  is 


(1.3b) 


ii  C  ^  1  31/  _  rT  _  r  i 

^i  7  3c.  ci  ^  tcl*  ..•»  C|<J 


The  definition  for  input  costs  employed  herein  and  in  [3]  is 


=  ||y||g 


(1.4a) 


and  the  definition  for  output  costs  described  herein  and  in  [3]  is 


(1.4b)  V*  =  11m  l  E y, 


.  V(t)  A  yTQy 


To  see  clearly  the  distinctions  between  (1.3a)  and  (1.4a)  note  that  the 
inputs  and  outputs  of  (1.1a)  can  be  expanded  in  terms  of  the  columns 
of  B,  and  the  rows  of  C  i.e. 

m 


(1.5)  Bw  =  l  b..w..  and 


(1.6)  y.  =  (Cx)i  =  ciT  x(t)  . 

Note  that  definitions  (1.3)  are  given  in  terms  of  parameters  b^  and  c^ , 
whereas  the  definitions  (1.4)  are  given  in  terms  of  the  variables  w^  and 
y-.  The  early  work  on  measurement  optimization  [8],  [1]  used  the 
parameters  c.  in  the  optimization  process.  However,  the  parameters  b^ 
and  c^  are  only  the  coefficients  of  the  variables  for  which  we  wish  to 
determine  a  performance  value.  That  is,  the  physical  entity  which 
might  be  deleted  generates  w^ (t)  or  y. (t).  Hence,  it  is  reasonable  to 
expect  definitions  (1.4)  to  be  more  accurate  in  predicting  the  effect 
of  the  deletion  of  an  input  or  output.  However,  there  is  an  important 
case  when  definitions  (1.3)  and  (1.4)  yield  the  same  result. 

Theorem  1 

Consider  the  linear  system  (1.1),  (1.2).  If  u(t)  =  0,  then 
l/fb  =  V.W  and  Vf  =  P.y  . 

Proof: 

To  show  that  V^c  -  V ^  ,  compute  first 


(1.7b) 


lim 

t-*» 


E{  T  y.Q..y.' 
1  ,yJwiJJV 


(1.7c) 


*  lim  E[yy  Q]. 


t-*» 


ii 


Then  note  from  (1.3b)  that  . 

(1.8a)  V.c  =  7  |r~  ci  =  1 1m  \  (9ac  ^ei  =  (  Z  EyiQijyj)c 

1  2  8ci  1  t—  1  3ci  t+o  1  3ci  i  ,j=l  1  10  J 

^  t  t 

=  limj-|—  (  l  E(c.Tx)Q.  .(c.Tx))  c. 
t—  1  3ci  i  ,j=l  1  1J  J  1 

k  j  x  k 

=  lim  E[  E  x  c.Q..x  ]c.  =  lim  E{  i  y.Q..y.) 
t~  j=l  J  1  t+»  j=l  J  1J  1 


(1.8b) 


=  lim  Elyy7 QL,  -  lim  E[cxxTcTQ]. .  =  [CXCTQ]. 

t-X»  1 1  t-»~ 


n 


where  X  is  the  steady  state  covariance  satisfying 

(1.9)  0  =  XAT  +  AX  +  BWBT 

Hence,  for  open  loop  systems  (u(t)  =  0),  (1.7c)  and  (1.8b)  prove  that 
V.jC  =  l^y  .  The  proof  that  follows  similar  arguments,  using 

the  fact  that 

(1.10)  V  =  tr  [SBWBT] 
where  S  satisfies 

(1.11)  0  =  SA  +  ATS  +  CTQC  , 


and  leads  to  the  calculation 
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(1.12)  l/.W  =  l/.b  =  [BTSBW]..  .  # 


Certainly  the  new  definitions  (1.4a)  and  (1.4b)  offer  no  advantage 


in  open-loop  situations,  as  evidenced  by  Theorem  1.  However,  in  closed- 


loop  situations  the  optimal  closed-loop  plant  matrix  A,  which  has  the  form. 


a  [A  BG  "1 

(1.13a)  A  =  (_FM  A+BG-FmJ  ' 


F  =  PmV1 


0  =  PA'  +  AP  - 


pmV1  m>+bwbt 


G  =  -R'Vk,  0  =  KA  +  ATK 


kbr-1btk  +  CTQ 


(1.13b) 


x  =  Ax  +  Bhj  ,  xT  =A  (xT,  xT)  ,  =  (wT,  vT) 

y  =  Cx  ,  t/T  =  (yT,  uT) 


(1.13c) 


t  rw  °i  T 

E«j(t)w  (t)  *  WS(t-i)  =1  I  6(t-x),  Ex(o)w  (t)  =  0. 

Lo  vj 


=*C  “]  •  «-C  3 


is  a  function  of  the  input/output  parameters  b.. ,  c...  This  situation  does 
not  occur  in  the  open-loop  system  since  A  in  (1.1a)  is  not  a  function  of 


the  parameters  of  B  or  C.  Thus  the  definitions  (1.3)  and  (1.4)  will  yield 
different  results  for  closed-loop  applications.  The  closed-loop  results 
which  follow  from  (1.4)  are  developed  in  this  paper. 


The  properties  of  input  costs  (1.4a)  and  output  costs  (1.4b)  for 


open-loop  systems  are  discussed  in  sections  2  and  3  respectively.  The 


essential  results  of  these  two  sections  are  combined  in  section  4  to 


produce  the  closed-loop  ICA/OCA.  Closed-loop  ICA/OCA  is  then  used  in 


the  selection  of  noisy  actuators  (section  5)  and  the  selection  of  noisy 
sensors  (section  6),  while  the  problem  of  simultaneous  selection  of 


noisy  actuators  and  sensors  is  addressed  in  section  7. 


m 


m 


In  Section  8,  the  methods  suggested  in  this  paper  are  applied  to  a 
substantial  example  of  a  large  9pace  structure.  Concluding  comments 
appear  in  Section  9. 


2.0  Properties  of  Input  Costs  (1.4a)  for  Open-Loop  Systems  (u(t)  =  0) 
Before  applying  results  (1.4)  to  the  closed  loop  system  it  will  prove 
useful  to  know  the  properties  of  the  input  cost  l/.w  for  the  open-loop 
system  (u(t)  =  0).  This  Section  is  devoted  exclusively  to  the  case 
u(t)  =  0  in  (1.1),  (1.2)  with  A  asymptotically  stable.  Proofs  of  all 
remaining  theorems  appear  in  Appendix  A. 

Theorem  2 

The  open-loop  input  costs  1/^,1  =  1 . m,  defined  by  (1.4a)  and 

calculated  by  (1.12),  satisfy  the  cost-decomposition  property 


(2.1) 


■  ill 


(u(t)  i  0)  . 


Where  the  total  value  of  the  system  performance  metric  is  V,  and  the 

w 

in  situ  contribution  from  W^  is  V.  . 

The  sign  of  U^w  is  nonnegative  under  these  conditions. 

Theorem  3 

For  the  stable  open-loop  system  (1.1),  1.2),  u(t)  =  0,  W  >  0,  if 

W.j  =  0  for  all  j  f  i  then  V.^  >_  0.  If,  in  addition  (A,  C)  is  an 

w 

observable  pair,  then  y.  >0. 

If  the  number  of  active  inputs  is  not  perturbed  it  is  sufficient  to 

w 

know  the  in  situ  input  cost  .  However,  additional  information  is 
required  in  order  to  determine  the  amount  by  which  the  performance  metric 
is  perturbed  after  an  input  is  deleted.  This  amount  is  defined  by 


(2.2) 


<  "  V  -  VRi 


where  t/Ri  is  the  value  of  1/  after  the  ith  input  is  removed.  Clearly,  Piw 
is  the  information  available  to  the  analyst  prior  to  deletion  of  the  i^ 


mi 


mm 


wws 


f 
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input  and  Al^w  is  the  information  available  after  such  deletion.  If 
l^w  is  to  be  used  a  priori  as  a  prediction  of  Al/^w,  it  is  of  interest  then 
to  know  how  accurately  V  *  predicts  the  actual  value  of  Al/^w.  Fur¬ 
thermore,  it  is  of  interest  to  know  when  this  prediction  is  exact  =  Al/^w). 

Theorem  4 

If  u(t)  =  0  in  (1.1),  (1.2)  then 

(2.3)  Al/iW  *  2l/.W  -  b.T  Sb.Wi.  . 

w 

If  W  is  a  diagonal  matrix  then  has  the  following  properties: 

(2.4)  l/iw  =  Al/.w  ,  i  =  1,  ....  m 

•  w  A  — 

(2.5)  l/(m)  =  J  I/.  ,  R  =  set  of  m  retained  inputs 

ieR  1 

where  l/(m)  is  the  value  of  V  with  only  the  reduced  set  of  m  inputs  acting . 

The  circumstances  (2.4),  (2.5)  are  very  valuable,  since  they  allow 
the  performance  of  the  reduced  input  system  l/(m)  to  be  evaluated  on  the 
basis  of  information  computed  prior  to  deletion  of  the  inputs  (l/jW).  Hence, 
when  (2.4)  holds,  then  (2.5)  follows.  Eq.  (2.5)  is  referred  to  as  the 
cost-superposition  property  of  inputs  costs,  whereas  (2.1)  is  referred  to 
as  the  cost-decomposition  property.  The  cost-decomposition  property  (2.1) 
always  holds,  whereas  (2.5)  holds  only  under  certain  restricted  conditions, 
such  as  the  diagonal  W  condition  of  Theorem  4. 

Finally,  it  should  be  noted  that  the  input  costs  V.w  art  invariant 
under  coordinate  transformation. 

Theorem  5 

If  u(t)  =  0  in  (1.1),  (1.2),  the  input  costs  defined  by  (1.4a)  are 
invariant  under  state  transformation  X  =  Ts ,  |T|  0. 


'■O  VI  .  ' 


a 


Since  the  input  costs  V*  provide  one  with  information  concerning 
the  individual  contributions  of  each  input  to  the  performance  metric,  the 
next  logical  use  of  this  information  is  to  describe  an  algorithm  which 
provides  an  answer  to  problem  (i)  in  the  Introduction.  This  shall  be 
referred  to  as  the  Open-Loop  Input  Selection  Problem. 

There  are  two  different  statements  of  the  Open-Loop  Input 
Selection  Problem.  We  shall  call  them  problem  (ICA-1)  and  (ICA-2): 

(ICA-1)  Let  m  and  in  <  m  be  specified  integers.  From  an  admissible  set 
of  m  inputs  find  a  reduced  set  of  m  inputs  such  that  the  input 
error  index  I, 

(2.6)  I  *  l^| 

is  minimized. 

(ICA-2)  Let  m  and  I  be  specified  numbers.  From  an  admissible  set  of 
inputs  find  the  smallest  number ,  m,  of  inputs  such 
that 

Z  Ji  i 

The  following  algorithm  provides  a  systematic  means  to  solve  problems 
(ICA-1)  or  (ICA-2). 

The  ICA  Algorithm 

STEP  1:  Specify  system  data  (A,  B,  W)  and  output  objectives  (C,  Q) 

for  (1.1),  (1.2)  with  u  =  0.  Specify  either  m  or  I  (to  solve 
either  problem  (ICA-1)  or  (ICA-2). 

STEP  2:  Compute  the  input  cost  perturbation,  from  (2.3),  for  1=1,  ....  m. 
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(2.7)  417, w  *  2[BTSBW]„  -  [BTSB«0](1  ,  Wp  -  diag  {W„ . 

(2.8)  0  -  SA  +  ATS  +  CTQC 

Of  course, if  W  is  diagonal,  then  W  =  WQ  and  according  to  Theorem  4  the 
calculation  (2.7)  can  be  replaced  by 

(2.9)  Al/J*  =  l/.w  =  [BTSBW] . . 

Rearrange  the  inputs  w.  such  that  >  at >...,>  at /  w  . 

i  I  —  c  —  —  m 

STEP  3:  If  I  is  specified,  go  to  STEP  4.  If  m  is  Specified  and  W  is 

diagonal,  then  retain  those  m  inputs  having  the  m  largest  values 

of  al/,W  and  stop.  If  W  is  not  diagonal,  delete  the  input  having 

the  smallest  value  of  aV.w.  For  this  reduced  B  =  [b, ,  ....  b  ,] 

i  I  m- 1 

return  to  STEP  2.  (Note  that  2.8  does  not  have  to  be  recomputed.) 
Repeat  this  cycle  m-m  times.  END. 

STEP  4:  If  7  is  specified,  delete  the  input  wm  with  the  smallest  al/^w. 

With  this  reduced  input  matrix  B  =  [b-j . bm-l^’  return  t0 

STEP  2  unless  the  following  condition  is  satisfied:  (Note  that 
(2.8)  does  not  have  to  be  recomputed.) 

(2.10)  I  =  >  I  ; 


where  V  equals  the  value  of  performance  metric  with  all  inputs 
present  and  m  refers  to  the  number  of  inputs  of  the  current 
iteration.  If  2.10  is  satisfied,  the  required  set  of  reduced 
inputs  is  the  set  of  inputs  from  the  previous  iteration  (i.e. 
a  set  of  m  +  1  inputs).  END.  (Note:  if  W  is  diagonal,  the 

STEP  4  iteration  cycle  is  unnecessary,  and  l/-l/(m)  =  I  aU.  ; 

w  ieJ 

J  =  set  of  lowest  's  calculated  from  the  1st  iteration. 


All  of  the  questions  of  Section  2.0  can  be  applied  to  outputs  instead 


of  inputs.  To  make  the  presentation  of  these  similar  results  concise, 
the  following  duality  can  be  used.  It  is  straightforward  to  show  that 
substitution  of  the  parameters  of  TABLE  1  converts  all  of  the  ICA  results 
of  Section  2.0  to  results  which  hold  for  Output  Cost  Analysis  (OCA). 

With  the  help  of  this  duality  the  following  results  are  listed  as 
corollaries  to  the  theorems  corresponding  to  their  ICA  application  of 
Section  2.  The  proof  of  these  corollaries  is  the  exact  dual  (Table  1)  of 
the  proofs  for  Theorems  1-5. 

Corollary  to  Theorem  2: 

y 

The  open-loop  output  coats  V.j  ,  i=l . k  defined  by  (1.4b)  ccnd 

calculated  by 

(3.1)  V.y  =  [CXCTQ].. 
where  X  satisfies 

(3.2)  0  =  XAT  +  AX  +  BWBT  , 

satisfy  the  cost-decomposition  property 

(3.3)  IT-  l  I//  . 

1=1  1 

Corollary  to  Theorem  3: 

For  the  stable  open-loop  system  (1.1),  (1.2),  (u(t)  =0),  Q  >  0, 
if  Qij  =  0  for  all  j^i  then  l/.y  >  0,  If,  in  addition ,  (A,  B)  is  a 
controllable  pair ,  then  V?  >  0. 

Corollary  to  Theorem  4: 

If  u(t)  =  0  in  (1.1),  (1.2)  then 


El  4 


(3.4)  =  2l/^y  -  c.T  X  ;  where  c1  is  the  ith  column  of  CT 

y 

If  Q  is  a  diagonal  matrix  then  tE  has  the  following  properties: 

(3.5)  l/.y  =  LV.y 

(3.6)  V(k)  =  I  V .y  ,  R  =  set  of  R  retained  outputs 

ieR  1 

where  V(k)  is  the  value  of  V  with  only  the  reduced  set  of  outputs. 

Corollary  to  Theorem  5: 

If  u(t)  =  0  in  (1.1),  (1.2),  the  output  costs  V.y  defined  by  (1.4b) 
are  invariant  under  state  transformation  X  =  Ts,  |T|  f  0. 

The  Open-Loop  Output  Selection  Problem 

Since  the  output  costs  provide  one  with  information  concerning 
the  individual  contributions  of  each  output  to  the  performance  metric, 
the  next  logical  use  of  this  information  is  to  describe  an  algorithm 
which  provides  an  answer  to  questions  (OCA-1)  and  (OCA-2)  below. 

(OCA-1)  Let  k  and  R  <  k  be  specified  integers.  From  an  admissible 
set  of  k  outputs  find  a  reduced  set  of  R  outputs  such 
that 

0.7)  0  i  |^| 

is  minimized. 

(OCA-2)  Let  k  and  0  be  specified  numbers.  From  an  admissible  set 
of  k  outputs  find  the  smallest  number ,  R,  of  outputs  such 
that 

e  <  e 


The  following  algorithm  provides  a  systematic  means  to  solve  problems 
(OCA-1)  and  (OCA-2). 

The  OCA  Algorithm 

STEP  1:  Specify  system  data  (A,B,W)  and  output  objectives  (C,Q)  for 

(1.1),  (1.2)  with  u  =  0.  Specify  R  (to  solve  problem  (OCA-1)) 
or  specify  e  (to  solve  problem  (OCA-2)). 

STEP  2:  Compute  the  output  cost  perturbation,  from  (3.4),  for  i=l . k 

(3.8)  Al/.y  =  2[CXCTQ]..  -  [CXCTQD3  ,  QD  =  diag  {Qn . Qkk} 

(3.8a)  0  =  XAT  +  AX  +  BWBT 

Of  course,  if  Q  is  diagonal,  then  Q  =  QD  and  according  to  the 
corollary  to  theorem  4  the  calculation  (3.8)  can  be 
replaced  by 

(3.9)  A  V.*  =  =  [CXCTQ].i 

Rearrange  the  outputs  y.  such  that 
Al/^y  _>  Al^  >_•••_>  A^ky 


STEP  3:  If  0  is  specified  go  to  STEP  4.  If  k  is  specified,  and  Q  is 
diagonal,  then  retain  those  R  outputs  having  the  R  largest 
values  of  Al/jy  and  END.  If  Q  is  not  diagonal,  delete  the 
output  having  the  smallest  value  of  A^y.  For  this  reduced 
CT  =  [c.|,  ...,  c  ,]  return  to  STEP  2.  (note  thct  3.8a  does 
not  have  to  be  recomputed).  Repeat  this  cycle  k-R  times. 


STEP  4:  If  0  is  specified,  delete  the  output  y^  with  the  smallest 
Al^y.  With  this  reduced  output  matrix  CT  =  [c-,,  . ...  cm 
return  to  STEP  2  unless  the  following  condition  is  satisfied, 
(note  that  (3.8a)  does  not  have  to  be  recomputed). 

(3.10)  0  £  |^j^|  >  0  ; 

where  R  refers  to  the  number  of  outputs  in  the  current  iteration 
If  3.10  is  satisfied,  the  required  reduced  set  of  outputs  is  the 
set  of  outputs  from  the  previous  iteration  (i.e.  a  set  of  lc+1 
outputs)  END.  (Note:  if  Q  is  diagonal,  the  STEP  4  iteration 

cycle  is  unnecessary,  and  V-l/(E)  =  \  Al/.y;  J  =  set  of  lowest 

w  UJ 

Al/.  's  calculated  from  the  1st  iteration. 

4.0  Closed-Loop  ICA/OCA 

Sections  2  and  3  assumed  u(t)  =  0  in  (1.1),  (1.2).  However, 
it  was  the  closed-loop  situation  (1.13)  which  motivated  the  definitions 
(1.4)  of  input  and  output  costs.  To  treat  the  closed-loop  system  (1.13) 
the  following  will  be  assumed: 

(A,  C)  observable 
(4.0)  (A,  M)  observable 

(A,  B)  controlable 

Under  these  conditions  the  matrix  A  in  (1.13)  is  guaranteed  stable  [4]. 
Now  the  concepts  of  Sections  2  and  3  will  be  applied  to  (1.13).  The 
practical  value  of  such  analysis  is  the  determination  of  sensor  and 
actuator  devices  which  are  most  critical  to  the  performance  metric 
(1.2).  Thus,  reasons  (ii)  and  (iii)  in  the  Introduction  are  the 
paramount  goals  of  Section  4.0. 

The  performance  metric  V  associated  with  the  closed-loop  system 
(1.13)  is  given  by  (1.2),  and  can  be  rewritten  in  the  form 


(4.1) 


V  =  lim  E\\tj\\l 


J 


'  1  7 


where  y  is  defined  in  (1.13b). 
to  (1.13)  yields 

(4.2)  V*  =  [BTSBW] . .  = 


«■[:  3 

Thus,  input  cost  analysis  (ICA)  applied 


i=l,  ...»  m 
j=l,  ....  I,  i  =nH- j  I 


(4.3) 

(4.4) 


0  =  SA  +  ATS  +  CT£C 

m+£  m  £ 

V  =  l  V*  •  l  +  l 

i=l  1  i=l  1  i=l  1 


and  output  cost  analysis  (OCA)  applied  to  (1.13)  yields 

(V*  1=1 . k 

(4.5)  V*  =  [CXCT<2],  •  =  <  u 

IVi »  j=l,  ...,  m,  i=k+j 


(4.6) 

(4.7) 


0  =  XAT  +  AX  +  BWBT 

k+m  ,  k  ,  m 

v-  i  i  vy+  i  v» 


i=l 


i=l 


w  th 

where  V.  may  be  interpreted  as  the  effect  of  the  i  plant  noise  on 

the  closed-loop  performance,  i/.v  is  the  effect  of  the  i^  measurement 

V  th 

noise  on  the  closed-loop  performance,  y..  is  the  effect  of  the  i 
output  yi  on  the  closed-loop  performance,  and  U.u  is  the  effect  of  the 
i*^1  control  input  u^  on  the  closed-loop  performance.  The  computation 
of  l^,  l^u,  l/.W,  l^V  follow  from  (4.2)  and  (4.5),  but  the  2nx2n  matrix 
equations  (4.3)  and  (4.6)  actually  represent  an  excessive  burden.  The 
following  result  shows  that  the  special  structure  of  A  in  {1.13a)  allows 
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the  solutions  of  (4.3)  and  (4.6)  to  be  expressed  in  terms  of  the  Riccati 
matrices  K  and  P  already  obtained  in  (1.13a). 


Theorem  6 


The  solutions  S  and  X  of  (4.3)  and  (4.6)  respectively  may  he  written 


(4.8) 


"K+L  -L~ 

_  -L  L_ 


where  K  satisfies  0  1  KA  +  A^K  -  KBR'^B^K  +  C^QC,  and  L  satisfies 


(4.9) 


0  =  L(A-FM)  +  (A-FM)TL  +  GTRG 


"P+N  N 


(4.10) 


where  P  satisfies  0  =  PAT  +  AP  -  PMTV_1MP  +  BWBT,  and  where  N  satisfies 


(4.11)  0  =  N(A+BG)T  +  (A+BG)N  +  FVF* 


Theorem  6  and  the  special  structure  of  the  matrices  C,  B  and  W 
in  (1.13),  (4.1),  allow  these  expressions  as  simplifications  of  (4.2) 
and  (4.5): 


(4.12)  l/.w  =  [BT£BW]if  =  [BT(K+L)BW]..,  i  =  1 . m 

(4.13)  =  [BTXBW] j .  =  [FTLFV]ii,  j  =  m+i ,  i  =  1,  ...,  I 


(4.14)  V*  =  [CXCT0]..  =  [C(P+N)CTQ]..  ,  i  =  1,  ....  k 

(4.15)  |/iu  =  [CXCT0]jj  *  [GNGTR]i^  ,  j  =  k+i,  i  =  1 . 


Theorems  7-9  follow  in  the  same  manner  as  in  Section  2.  Their  proofs 
are  contained  in  appendix  A. 


Theorem  7 


For  the  closed  loop  system  (1.13)  under  the  conditions  (4.0),  if 

W.  .  =  0  for  all  j  t  i  then  V.w  >  0.  If  V.  .  =  0  for  all  j  t  i  then 
'  J  *  1  J 

0.  In  addition ,  if  (A-FM,G)  is  an  observable  pair  then  I /.v  >  0. 

Theorem  8 

For  the  closed-loop  system  (1.13)  under  the  conditions  (4.0),  if 

Qi  j  =  0  for  all  j  t  i  then  V ?  >0.  If  =  0  for  all  j  f  i  then 

>  0.  In  addition,  if  (A+BG,F)  is  a  controllable  pair,  then  l/..U  >  0. 

Theorem  9 

The  closed-loop  input  and  output  costs,  V? ,  ^u,  J/^W,  l/^V  are 
invariant  under  state  transformation  X  c  Ts,  |T|  f  0. 

Equations  (4.2)-(4.7)  and  (4. 12)- (4. 15)  can  be  used  to  set  up  the 
duality  of  TABLE  2  applicable  to  closed- loop  systems. 

5.0  Selection  of  Noisy  Actuators  in  Closed-Loop  LQG  Problems 

A  simplifying  assumption  made  in  (1.1)  is  that  the  only  source  of 
disturbances  w(t)  is  the  actuator  noises.  Thus,  the  disturbance  distri¬ 
bution  matrix  and  the  control  distribution  matrix  are  the  same  (B/=D) . 
Questions  of  the  type  (ICA-1)  and  (ICA-2)  from  Section  2  are  now  posed 
for  the  closed-loop  system.  The  questions  are  complicated  by  the 
fact  that  for  the  closed-loop  system  (u(t)  $  0)  there  are 
two  costs,  l/^w  and  l/^  associated  with  inputs  to  (1.1a).  Since  V*  is 
the  "bad"  effect  (from  noisy  disturbances)  and  l/..u  is  the  "good"  effect 
(from  optimal  control  action),  it  is  reasonable  to  choose  the  difference 
between  the  good  and  bad  effects 


% 


| M 


(5.1) 


i 


=  u 
i  i 

as  the  "value"  of  the  ith  actuator.  A  negative  value  of  l/^act  indicates 
that  the  ith  actuator  is  contributing  more  noise  than  control  action  and 
should  either  be  deleted  or  its  signal  to  noise  ratio  should  be  improved. 
If  one  defines  =  V.u/V^  as  the  "effective  signal  to  noise  ratio"  of 
actuator  i,  then  (5.1)  becomes 

(5.2)  =  (nrl)  vf 

If  ( A, C)  is  an  observable  pair,  and  (A+BG.F)  is  a  controllable  pair,  and 
the  matrices  (W.R)  have  no  negative  elements  then  theorems  7  and  8 
guarantee  that  >  0  (since  and  l/^w  are  both  >  0).  Under  these  con- 
dit.ons  improvement  in  performance  is  expected  by  deleting  the  actuators 
with  effective  signal  to  noise  ratios  less  than  l(n^  <  1)  as  long  as 
controllability  of  (A,B)  is  maintained.  The  two  closed-loop  versions  of 

actuator  selection  problems  (ASP-1  and  ASP-2)  are  now  defined. 

Problem  ASP-1:  Given  the  optimal  system  (1.13)  using  m  admissible  but 

noisy  actuators 3  find  the  set  of  in  actuators  which  lead  to  the  smallest 
value  of  the  closed  loop  system  performance  metric  1/  as  defined  by  (1.2) 
without  losing  controllability.  Label  this  minimum  value  as  l/(m) .  The 
The  computations  suggested  to  approximate  the  solution  of  problem 
(ASP-1)  are  as  follows: 

Algorithm  ASP-1:  Using  (5.1),  (4.12),  (4.15),  solve  for  l^..ac^,  i  =  1, 

dc  t  qc  t 

...m,  and  rearrange  so  that  I/.  _>  . . .  .  Delete  the  actuator  with 

the  smallest  value  of  V.  if  that  value  is  negative  or  zero  and  if 

deletion  of  that  actuator  does  not  reduce  the  rank  [B,  AB,  . An  ^B]. 

Solve  the  optimal  LQG  problem  for  the  reduced  number  of  actuators  (i.e. 
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B  =  [b-j  ,  -j],  W  =  [w^  , .  . . Wm _ -j 3 )  •  Repeat  the  above  cycle  until  no 

actuator  satisfies  the  deletion  criteria.  The  number  of  actuators  left 
is  the  set  of  m  actuators  to  be  used. 

Note:  At  present,  no  guarantee  exists  that  algorithm  ASP-1  provides  an 
optimal  solution  for  Problem  ASP-1;  however,  the  results  of  Section  8 
do  lend  some  empirical  support,  and  in  addition  they  suggest  a  possibility 
for  greatly  reducing  the  computational  burden  of  the  algorithm. 

Problem  ASP-2:  Specify  the  integer  m  and  the  number  I  >_  1.0.  Given 
the  optimal  system  (1.13)  using  m  admissible  but  noisy 
actuators ,  find  the  smallest  m  set  of  actuators  such 
that 

(5.3)  •  _)  <.  I.  (and  (A,B)  a  controllable  pair) 

V(m)  C 

where  V(m)  is  the  smallest  value  of  V  obtained  in 
(ASP-1)  and  l/(m)  is  the  value  of  1/  obtained  with 
m  <  m  actuators. 


Although  no  proof  of  optimality  is  currently  available,  the  following 
algorithm  is  suggested  for  the  solution  of  ASP-2. 


.m. 


Algorithm  ASP-2:  Solve  algorithm  ASP-1  and  use  this  set  of  m  actuators 

in  the  solution  of  the  LQG  system  (1.13).  Compute  ,  i  =  1 

and  rearrange  so  that  y.ac^  _>  .  .  ,j>  Delete  the  actuator  with  the 

act1  ™ 

smallest  value  of  V.  if  deletion  of  that  actuator  does  not  effect  the 

i  n- 1 , 


rank  of  [ B,  ....  A  B].  Solve  the  optimal  LQG  problem  for  the  reduced 
number  of  actuators.  Repeat  the  above  cycle  until  either  ^  <  7C  or 
until  all  remaining  actuators  are  necessary  for  controllability .  The 
previous  set  of  m  actuators  is  then  the  number  desired. 
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6.0  Selection  of  Noisy  Sensors  in  Closed-Loop  LQG  Problems 

Before  discussing  noisy  sensor  selection,  it  is  useful  to  consider 
the  following  theorem: 

Theorem  10 

Adding  noisy  sensors  to  the  LQG  system  described  by  (1.13)  cannot 
increase  the  performance  metric  V  (i.e.  4.1),  of  the  system. 

Heuristically  speaking,  Theorem  10  is  plausable  since  the  sensor 
measurements,  z,  are  being  passed  through  a  Kalman  filter  and  the  purpose 
of  the  Kalman  filter  is  to  "de-emphasize"  or  "throw-out"  the  measurements 
which  have  more  noise  than  estimation  information,  [4],  (Note:  A  result 
such  as  Theorem  10  does  not  exist  for  actuators.  In  fact  it  will  be  shown 
later  that  adding  actuators  can^  degrade  performance  (Likewise  deleting 
actuators  can  help).)  As  a  result  of  this  property  of  the  Kalman  filter, 
it  is  not  surprising  that  any  sensor  noise  source  (v^)  that  is  making  a 
large  contribution  to  the  closed  loop  performance  metric  (i.e.  large  l^v) 
eminates  from  a  sensor  which  is  making  an  even  larger  contribution  in 
estimation  information!  Based  upon  these  points,  the  following  definition 

i.U 

for  the  iu  sensor  value  is  offered. 

(6.1)  l/,5'"  i  |/(v 

Large  values  of  V.v  Indicate  sensors  which  are  providing  a  "large" 
amount  of  estimation  information  and  are  therefore  critical  to  the 
performance  of  the  closed  loop  system.  Sensors  with  smaller  values  of 
l/^v  are  providing  less  information  and  are  therefore  candidates  for  deletion. 

In  light  of  this  discussion  the  following  closed  loop  sensor  selection 
problems  and  algorithms  are  defined. 

Problem  SSP-1  :  Let  t  and  l  <  l  be  specified  integers.  From  an  admissible 
set  of  l  sensors  find  a  reduced  set  of  L  sensors  such  that 


(6.2) 


*  IMlil 

<P  \j 

is  minimized.  Where  V[Z)  is  the  system  performance  metric 
with  only  Z  sensors  operating. 


The  computations  suggested  to  approximate  the  solution  of  problem  SSP-1 
are  as  follows: 


Algorithm  SSP-1:  Solve  for  V.sen,  1*1,  ....t  from  (6.1),  (4.13)  and  rearrange 


sen  sen 

so  that  U.  _>  . . .  Vp  .  Delete  the  sensor  with  the  smallest  value  of 

•  ^  j 

l/^sen  if  deletion  of  that  sensor  does  not  reduce  the  rank  of  [M^,  . . .  M^] 


Solve  the  optimal  LQG  problem  for  the  reduced  number  of  sensors  (i.e. 

MT  =  [Mi ,  - i J ,  V  =  diag  [V1 ,  -  V^])  .  Repeat  the  above  cycle 

l-Z  times  or  until  all  remaining  measurements  are  necessary  for  observ¬ 
ability  of  (A,  M). 


At  present,  no  guarantee  exists  that  algorithm  SSP-1  is  optimal. 
However,  it  can  be  shown  (See  Appendix  B)  that  V?  (i.e.  l/^611)  is  closely 
related  to  the  switching  functions  of  the  extended-Chen-Seinfeld  method  for 
optimal  selection  of  sensors,  [1].  The  switching  functions  are  used  to 
indicate  the  sensors  which  satisfy  the  necessary  conditions  for  optimality 
[1].  (i.e.  minimization  of  (6.2)).  The  above  fact,  together  with  the 

limited  results  presented  in  section  8  indicate  that  algorithm  SSP-1  may 
in  fact  be  optimal.  Research  is  continuing  in  this  area. 

A  second  version  of  the  closed-loop  sensor  selection  problem  is 
presented  below. 

Problem  SSP-2:  Let  t  and  +  be  specified  numbers.  From  an  admissible 


set  of  t  sensors ,  find  the  smallest  number,  Z ,  of  sensors 
such  that 

<t>  f  <P 

and  observability  of  (A,  M)  is  maintained. 


The  computations  suggested  to  approximate  the  solutions  to  SSP-2  are 


as  follows: 

sen* 

Algorithm  SSP-2:  Solve  for  ,  i  =  1,  ...  and  rearrange  so  that 

con  S6I1 

l\.  1/^  .  Delete  the  sensor  with  the  smallest  value  of 

^1-Sen  if  deletion  of  that  sensor  does  not  effect  observability  of  (A,  M) . 
Solve  the  optimal  LQG  problem  for  the  reduced  number  of  sensors  (i.e. 

MT  =  [Mr  ...  V  =  diag  [V] ,  ...  V^]).  Repeat  the  above  cycle 

until  <J>  is  no  longer  less  than  or  equal  to  $  or  (A,  M)  is  no  longer 
observable.  At  this  point  the  last  set  of  sensors  is  the  desired  set  of 
i  sensor 8. 

Further  research  is  required  to  establish  the  relative  degree  of 
optimality  of  Algorithm  SSP-2.  . 

7.0  The  Sensor/Actuator  Selection  (SAS)  Algorithm 

The  actuator  selection  problems  and  algorithms  presented  in  section 
5  assumed  that  no  change  takes  place  in  the  number,  type,  and  location  of 
the  sensors.  The  sensor  selection  problems  and  algorithms  presented  in 
section  6  assumed  that  no  change  takes  place  in  the  number,  type,  and 
location  of  the  actuators.  In  practical  design  problems  one  needs  an 
algorithm  for  combined  actuator  and  sensor  selection.  The  fact  that 
actuator  and  sensor  selection  are  coupled  is  well  evidenced  by  equations 
(4.9),  (4.11)  and  (4.12-4.15).  As  a  result  of  this  coupling,  it  is 
necessary  to  have  an  algorithm  which  simultaneously  solves  problems  of 
type  (ASP-1,  ASP-2)  and  (SSP-1,  SSP-2).  As  our  first  attempt  for 
generating  such  an  algorithm  it  is  suggested  that  the  appropriate  actuator 
algorithm  and  sensor  algorithm  of  Sections  5  and  6  be  solved  simultaneously 
More  specifically,  if  it  is  desired  to  solve  ASP-1  and  SSP-1,  algorithms 
ASP-1  and  SSP-1  should  be  implimented  simultaneously.  The  optimality  of 
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such  an  algorithm  or  the  possibility  of  simplified  variations  of  such  an 


algorithm  is  currently  under  study. 

8.0  Hoop-Antenna  Example 

The  ASP-1  Algorithm  of  Section  6  was  applied  to  a  26-state  model  of  a 
Hoop-Antenna  Satellite  which  has  12  actuators,  39  sensors,  and  27  outputs. 
Data  was  also  collected  to  provide  some  verification  of  the  validity  of 
algorithm  SSP-1 .  For  the  sake  of  brevity,  only  the  general  details  of  the 
model  are  presented  in  (8.1).  A  complete  model  description  is  provided 
in  Appendix  C  along  with  a  schematic  describing  sensor  and  actuator 
locations. 


x  =  Ax  +  B(u  +  w);  xcR2^;  ueR^2;  weR^2 
y  =  Cx;  yeR27 

1/  =  lim  E{  ||y  ||  2  +  || u  ||  2};  Q  >  0,  R  >  0  (Q,  and  R  diagonal) 
t-*°°  y  * 

More  specifically, 


1/  = 


27  ?  q.  12  ?  r. 

lim  J  E{y.  }  +  £  E{u.  }  I 

t-~>  i=l  1  yf  i=l  1  ufm 
lm  lm  [ 


(8.1) 


where. 


•*» 


’i 


7T 


lm 


•  th 


y.  =  Max  allowable  value  of  i  output 
lm 


q.  =  dimensionless  output  weight 


r. 


.th 


Rii  s~r  * 


u.  =  Max  allowable  control  effort  for  i  actuator 
lm 


uim 


' r .  =  dimensionless  control  weight 


,39 


z  =  Mx  +  v;  ZcR' 

E{w(t)}  =  E{v(t) }  =  0 

E{w(t)wT(x)  =  W <5 ( t- x ) ;  W  >  0 

E{v(t)vT(t)}  =  1/6 ( t-T> ;  1/  >  0 

E{v(t)wT(T)}  =  0,E{x(t)wT(r)>  =  0,  E{x(t)uT(x)}  =  0,  t  >  t 


-  iguCtuLm.  MM  jCmJCm 
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A  closed  loop  analysis  of  (8.1)  was  performed  and  Q  and  R  were  chosen  such 
that  system  specifications  were  met  (i.e.  y?  >  EC v . 2 ) .  i=l  27  u2  > 

i=l . 12).  Algorithm  ASP-1  was  then  applied.  Specifically,  the 

actuators  with  negative  values  of  l/.  (and  not  necessary  for  controllability) 
were  deleted  one  at  a  time.  Figure  la  shows  a  graph  of  the  total  system  cost 
versus  the  number  of  closed  loop  actuators  as  the  algorithm  progressed.  Thu 
nurubers  above  the  bar  graphs  indicate  deleted  actuators.  Figure  lb  shows  the 
total  output  cost  versus  the  number  of  closed  loop  actuators. 


Note  that  the  algorithm  recotmiends  a  6-actuator  closed  loop  system  and 
that  this  system  is  almost  25%  better  in  terms  of  total  cost  and  performance 
cost  than  the  original  12-actuator  system.  It  is  important  to  note  also 
that  the  control  effort  of  each  actuator  in  the  reduced  actuator  system 
was  still  within  its  specifications. 

As  a  partial  check  on  the  optimality  of  the  ASP-1  algorithm,  the 
system  was  iteratively  operated  with  each  one  of  the  12  actuators  deleted 
while  the  11  other  actuators  remained.  The  results  are  shown  in 
Figure  2. 

From  Figure  2  it  is  apparent  that  deleting  actuator  10  would  be  the 


optimal  decision  if  only  1  actuator  were  to  be  deleted.  This  result  agrees 
with  the  decision  of  Algorithm  ASP-1  for  one  actuator  (See  Fig.  1).  To 
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check  the  complete  optimality  of  Algorithm  ASP-1  empirically,  the  analysis 
of  Fig.  2  would  have  to  be  repeated  for  any  2  actuators  deleted,  any  3 
actuators  deleted,  etc.  until  the  entire  set  of  possible  combinations  were 
exercised.  Clearly,  this  brute  force  approach  is,  computationally,  an 
undesirable  alternative  to  Algorithm  ASP-1. 

The  in  situ  contribution  or  "value"  of  the  i^*1  actuator  is  given  by 

acf 

.  However,  the  actual  perturbation  of  the  cost  after  the  deletion  of 
the  i*^  actuator  will  be  labeled  Al^ac*. 

Mathematically  speaking, 

(8.2)  Al/..act  =  l/(m-l )  -  l/(m) 

where  U(m-l)  is  the  cost  of  the  system  operating  without  the  i  1  actuator 
and  l/(m)  is  the  cost  of  the  system  operating  with  all  actuators. 

•  art  ar f 

Since  we  intend  to  use  V.  as  an  approximation  of  aV.  in  the 
actuator  selection  process,  we  wish  to  know  how  good  the  approximation 
is. 


Figure  3  indicates  that  V.act  tracks  the  actual  AV.act  with  some  bias. 
However,  a  more  important  feature  of  actuator  costs  would  be 

(8.4)  (l/.act  >  1/  act)  =>  ( Al/.act  >  M/.act) 

®  J  1  J 

Verification  of  (8.4)  would  be  sufficient  for  verifying  the  optimality  of 
Algorithm  ASP-1.  By  comparing  the  data  of  Figure  2  to  the  ordering  of 
V.  with  all  actuators  present,  condition  (8.4)  was  verified  for  the 
example  problem. 

Another  interesting  fact  noticed  in  the  data  for  this  example  was 
that  if  the  deletion  criteria  of  Algorithm  ASP-1  was  applied  only  to  the 
original  ranking  of  l/.act,  (i.e.  a  new  ranking  was  not  calculated  after 
each  deletion)  the  algorithm  still  retained  the  same  six  actuators  as 
before.  If  this  result  is  shown  to  be  a  property  of  V^act:,  Algorithm 
ASP-1  would  become  non-iterative  and  this  would  greatly  reduce  its 
computational  burden.  In  summary,  the  results  presented  above  indicate 
the  usefulness  and  potential  optimality  of  Algorithm  ASP-1  for  solving 
the  noisy  actuator  selection  problem. 


'  *  .  f  ^  / 


*vv 


y 

Figure  4  indicates  that  V.  is  a  good  prediction  of  the  increase  in  the 
performance  metric  that  results  when  the  ith  sensor  is  deleted.  Comparing 
Figure  4  to  Table  3  the  following  observation  can  be  made 

(8.5)  l/.v  >  V*  =>  >  &V. 

If  (8.5)  can  be  shown  to  be  a  generic  property  of  the  SSP-1  algorithm,  it 
would  be  sufficient  to  prove  the  optimality  of  the  Algorithm. 

9.0  Conclusion 

This  paper  has  defined  Input  Cost  Analysis  (ICA)  and  Output  Cost 
Analysis  (OCA)  and  shown  the  basic  properties  of  each.  Algorithms  using 
ICA  and  OCA  in  an  open  loop  setting  were  then  presented  and  they  provided 
an  optimal  solution  to  the  open-loop  input  and  output  selection  problems 
posed  in  Sections  2  and  3.  In  Section  4,  closed  loop  versions  of  ICA  and 
OCA  were  developed  and  their  properties  discussed.  Selection  of  noisy 
actuators  was  discussed  in  Section  5.0  and  algorithms  using  closed-loop 
ICA  and  OCA  were  suggested  for  the  solution  of  ASP-1  and  ASP-2.  In 
Section  6  it  was  shown  that  adding  sensors  cannot  degrade  LQG  performance. 
Noisy  sensor  selection  problems  were  defined  and  algorithms  using  closed- 
loop  OCA  and  ICA  were  posed  for  the  solution  of  SSP-1  and  SSP-2.  In 
Section  7,  a  combined  sensor  and  actuator  selection  algorithm  was  suggested. 
The  noisy  actuator  selection  algorithm,  ASP-1,  of  Section  5,  was  applied 
to  a  model  of  a  hoop  antenna  satellite  in  Section  8.  The  results  indicated 
that  the  system  performed  better  with  a  fewer  number  of  noisy  actuators. 
Research  is  continuing  in  these  areas.  In  addition,  data  was  also  presented 
in  Section  8  which  supported  the  optimality  of  algorithms  SSP-1  and  SSP-2. 
Both  empirical  and  analytical  research  on  the  optimality  of  the  algorithms 
posed  in  this  paper  are  continuing  with  particular  focus  on  the  combined 
algorithms  suggested  in  Section  7. 
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Proof  of  Theorem  2: 


For  u  =  0,  it  can  be  shown  [4]  that  the  1/  defined  in  (1.1)  is  equal  to 
value  given  by  (1.10),  (1.11).  Using  the  fact  that  trAB  =  trBA,  (1.10) 
occomes 

(Al)  V  *  tr[SBWBT]  =  tr[BTSBW] 

But,  from  (1 .12) 

(A2)  I  l/.W  =  I  [BTSBW]..  =  tr[BTSBW] 

i=l  1  i=l  11 

Therefore,  from  (Al)  and  (A2) 

m  w 

V  ■  £  V  , 

i*l  1  * 


Proof  of  Theorem  3: 


The  expression  (1.12)  can  be  written  as  follows,  if  W  is  diagonal. 


(A3)  l/.w  =  bJsb.W..  (b.  =  ith  col.  of  B) 


The  stability  of  A  guarantees  that  S  is  the  unique  and  at  least  positive 
semi -definite  solution  of  the  Lyapunov  equation  (1.11).  Therefore,  since 
B  is  of  maximal  rank  (i.e.  ||b..||  >  0  for  all  i  =  1,  ...  m)  and  since 

W  >  0  implies  W^.  >  0,  then  (A3)  cannot  be  negative  and  the  first  part  of 
the  theorem  is  proved. 


The  observability  of  (A,  C)  guarantees  that  S  is  positive  definite. 
Hence  (A3)  is  strictly  positive  and  the  theorem  is  proved.  # 


Proof  of  Theorem  4: 


Without  loss  of  generality  but  with  considerably  more  ease  in  re¬ 
presentation,  assume  the  last  input,  w^,  is  to  be  deleted.  The  following 
partitions  are  defined: 

(A4)  B  =  [Br  bm3;  Br  cRnx("»-’).  bmERnxl 


(A5)  W  * 


WR  wRm 
wRm 


.  L,  ntm-DxM).  w  rRm'1x1-  W  rR1 
,  WrER  ,  WRmeK  ,  W^eK 


using  (A4)  and  (A5), (1  - 1 2)  can  be  written  as 

(A6)  V  =  tr[SBWBT]  «  tr[SBDWDBDT  +  Sb  wDmTBDT  +  SBDwD  b  T  +  Sb  w  b  T] 
1  R  R  R  mRmR  RRmm  m  mm  m  J 

after  deletion  of  the  mth  input, 

(A7)  ‘'(m-1)  -  tr[SBRWRBRT] 


and 


(A8)  V  w  =  [BTSBW]  =  b  TS Bdwd_  +  b  TSb  w 
m  mm  m  R  Rm  m  m  mm 

note  that  (A8)  can  be  written  as  follows: 

(A9)  =  tr[SBRWRmbmT  ♦  Sbm%mbmT] 

using  (A6)  and  (A7), 


(A10)  AVmw  «  V- b* (m- 1  )=*r[Sb  w_  TBRT]+tr[SBRwR  b  T]+tr[Sb  w  b  T] 


T„  T- 


m  Rm  R 


RRmm 


m  mm  m 


substituting  (A9)  into  (A10)  yields: 


(All)  =  trtSb^V]  +  v* 


Now,  using  tr(AB)  =  tr(ATBT)  and  the  fact  that  S  is  symmetric,  (All)  becomes 
(A12)  Al/mW  -  trESB^bJ]  +  vf 

Adding  and  subtracting  tr CSbmv^mibmT3  to  (A12)  results  in: 


(A13)  Abmw=  trESB^b^Sb^b/]  ♦  vf  -  tr[Sbmnm,bmT] 

using  (A9)  and  rewriting  tr[3y^J>nT]  =  bjsb^  gives 

(A14)  Al/  w  =  21/  w  -  b  TSb  w 

m  m  m  m  inn 

letting  m  =  i  in  (A14)  completes  the  proof  of  (2.3) 

For  Proof  of  (2.4)  look  again  at  (All) 

(A15)  -  trCSb^V]  ♦  vf 

Where  =  0  if  W  is  diagonal.  Hence  (A15)  leads  immediately  to  (2.4). 


for  Proof  of  (2.5), 

If  W  is  diagonal,  the  expression  for  V ^  becomes: 


(A16)  l/fW  =  I  b/sbjW..  -  b/sb^, 

J  * 


since  S  does  not  depend  on  B,  (Al 6 )  implies  that  the  input  cost  for  the  ith 
input  is  dependent  strictly  upon  the  i^  column  of  the  B  matrix  and  the 
ith  diagonal  entry  of  W.  Therefore,  no  matter  how  many  inputs  are  deleted, 
the  cost  of  the  i^  input  in  the  reduced  system  will  mt  change.  There¬ 


fore,  invoking  Theorem  2  for  the  reduced  input  system 
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l/(m)  =  z  l/.w 
ieR 


where  R  is  the  set  of  m  inputs  now  acting 

on  the  system. 

# 


Proof  of  Theorem  5: 

For  the  transformed  system, 

x  =  T_1ATx  +  T‘]Bw 
(AT  7)  y  =  CTx 

V  =  Tim  E{  j|y  H  0) ,  Q  >  0 


the  input  cost  is 


(A18)  l/iw  =  [BTT_T/a‘1BW].i  (from  1.12) 


where  from  (1.11) 

(A19)  KT~^AT  +  TTATT"TK  +  TTCTQCT  =  0 

post  multiply  by  T_1and  premultiply  by  T’T  in  (A19) 

(A20)  rT<T_1A  +  AWT1  +  CTQC  =  0 

Because  A  is  stable,  (A20)  and  (1.11)  both  have  a  unique  solution  which  in 

(1.11)  was  defined  to  be  S.  Therefore, 

(A21 )  s  =  rVr1 

or 

(A22)  K  =  TTST 

substituting  (A22)  into  (A18)  gives: 


(A23)  V?  =  [bYVstYbW]..  =  [BTSBW].. 

W 

which  is  identical  to  in  (1.12),  and  the  proposition  is  proved.  # 
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Outline  of  Proof  of  Theorem  6: 

The  most  straight  forward  proof  of  the  validity  of  (4.8  and  4.10) 
is  to  substitute  them  into  (4.3  and  4.6)  respectively  and  then  multiply 
out  the  partitioned  forms  of  the  matrices  which  are  given  in  (1.13). 

The  details  are  omitted. 

Proof  of  Theorem  7: 

For  diagonal  W  and  V,  equations  (4.12)  and  (4.13)  can  be  rewritten 
as  follows: 

(A24)  V*  =  biT(K+L)b.W.1,  i=l,  ...  m 

(A25)  V  =  fjTLfjVjj  (f.  =  Pm.V^'1),  m.  =  ith  col.  of  MT 

and  P  is  defined  by  (1.13). 

The  conditions  (4.0)  when  applied  to  the  closed  loop  system  (1.13) 
guarantee  that  K  is  positive  definite  and  L  defined  by  (4.9)  is  at  least 
positive  semi -definite.  [4]  Therefore,  since  W  >  0  and  V  >  0  (A24)  can 
never  be  zero  or  negative  and  (A25)  can  never  be  negative.  The  matrix  L 
defined  in  (4.9)  will  be  positive  definite  if  the  pair  (A-FM,  G)  is 
observable.  Hence,  with  this  condition,  (A25)  can  never  be  zero  or 
negative  and  the  theorems  proof  is  complete.  It  should  be  noted  that  the 
full  rank  of  B  and  M  implies  || b1 1|  f  0  and  ||f.  ||  +  0. 
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Proof  of  Theorem  8: 

The  proof  is  the  exact  dual  (Table  2)  of  the  proof  of  Theorem  7. 
Proof  of  Theorem  9: 

Since  the  closed  loop  system  of  1.13b  is  in  the  same  form  as  1.1 
the  proof  of  Theorem  5  and  its  corollary  applies  directly  to  Theorem  9. 

Proof  of  Theorem  10: 

Prom  [4]  it  is  known  that  the  closed  loop  performance  metric  V  (4.1) 
of  (1.13)  can  be  expressed  as  follows: 

(A26)  1/  *  tr[KBWBT  +  PGTRG] 

Now  let  V  equal  the  system  performance  metric  for  the  system  operating 
with  one  additional  sensor.  Therefore t 

(A27)  V+  =  tr[KBKBT  +  P+GTRG] 

where 

(A28)  P+AT  +  AP+  -  P+M+TV+"1M+P+  +  BWBT  =  0 

(A29)  M+  =  I  T  I  ;  meRn  T 

j_  m  J  (i.e.  added  column  of  M  matrix) 

(A30)  V+  =  P  °  1  ;  v+eRlxl 

L  V+J  (i.e.  variance  of  new  sensor  noise) 

Subtracting  (A27)  from  (A26)  gives  the  following: 


(A31 )  AU  i  V-V+  =  tr[(P-P+)GTRG] 
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Equation  (A31)  can  be  rewritten  as  follows: 

(A32)  At/  =  tr[(P-P+)GTRG]  =  tr[  /rT(P-P+)  /rTT] 

Therefore,  if  (P-P  )  is  at  least  positive  semi -definite  the  theorem  is 
proved. 

Recall  that  the  matrix  P  used  here  and  in  (1.13)  is  defined  by  the 
following: 

(A33)  PAT  +  AP  -  PMTV_1MP  +  BWBT  =  0 
Now,  subtracting  (A28)  from  (A33)  gives: 

(A34)  (P-P+)AT  +  A(P-P+)  -  PmV^P  +  P+M+TV+‘\p+  =  0 
adding  ±  PM+TV+_1M+P+  to  (A34)  yields: 

(A35)  (P-P+)(AT-M+TV+_1M+P+)  +  A(P-P+)  -  PMTV_1MP  +  PM+TV+~]M+P+  =  0 

adding  ±  P+M+TV+_1M+P+,  ±  P+M+TV+_1M+P  to  (A35)  results  in: 

(A36)  (P-P+)(AT-M+TV+"1M+P+)  +  (A-P+M+TV+'1M+)(P-P+)  -  PMTV_1MP  + 

pm+tv+_1m+p+  +  p+m+tv+_1m+(p-p+)  =  0 

adding  ±  Pmv+-1mTP  to  (A36),  making  use  of  (A29)  and  (A30)  and  the 
definition  F+  =  P+M+TV+-1  gives: 

(A37)  (P-P+)(A-F+M+)T  +  (A-F+M+)(P-P+)  -  PM+TV+_1M+(P-P+)  + 

P+M+TV+-]M+(P-P+)  +  Pmv+-1mTP  =  0 

collecting  terms  gives: 

( A38 )  (P-P+)  (A-F+M+)T  +  (A-F+M+)(P-P+)  -  (P-P+)M+TV+"\(P-P+)  + 

Pmv+"^mTP  =  0 
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Equation  (A38)  is  a  standard  matrix  Ricatti  eq.  It  is  well  known,  [4], 
that  the  soln.  to  (A38)  (i.e.  P-P+)  is  at  least  positive  semi-definite 
if  the  matrix  (A-F+M+)  is  stable.  The  matrix  (A-F+M+)  will  be  stable 
if  the  matrix  pair  (A,  B)  is  controllable  and  the  pairs  (A,  C)  and  (A,  M+) 
are  observable.  From  the  conditions  (4.0)  (A,  B)  is  controllable  and 
(A,  C)  and  (A,  M)  are  observable.  Therefore  (A,  M+)  must  be  observable 
since  adding  a  Row  to  M  (i.e.  generating  M+  cannot  effect  the  observability 
of  (A,  M).  # 


Y ■-  vVvV/ilVDO.V.V. 
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In  [1]  it  was  shown  that  the  switching  function  for  the  extended 
Chen-Seinfeld  method  of  optimal  selection  of  sensors  in  systems  of  type 
(1.13)  was 

(61)  »c*  ■  tr(Pm.V-1mjTPA2) 

where  m..  is  a  column  of  MT  and  a2  is  defined  by: 

(B2)  A2(A-PmV1M)  +  (AT-MTV"1MP)a2  +  KBR-1BTK  =  0 
where  P  is  defined  by: 

l 

(B3)  PAT  +  AP  -  ?  q.Pm.V'Vp  +  BWBT  =  0 

i-1  1  1  1 

1  if  ith  sensor  is  to  be  used 
0  if  ith  sensor  is  to  be  deleted 

The  expression  for  (//  is  given  by  (4.3),  using  F  =  PMTV-1  gives, 

(B4)  l/.v  =  [V'WPM1]^.  =  Vii"1m.TPLPm. 

whe  re 

(B5)  L(A-PMTV_1M)  +  (A-PMV’M)^  +  KBR'Vk  =  0 
(B6)  PAT  +  AP  -  PMTV-1MP  +  BWBT  =  0 


using  the  tr  AB  =  tr  BA  (Bl)  becomes  : 
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I 

comparing  eqs.  (B2),  (B3)  and  (B7)  to  eqs.  (B4),  (B5)  and  (B6)  it  is 

apparent  that,  with  the  exception  of  the  trace  operation  on  V'1  in  (B7), 

l/.v  is  equivalent  to  calculating  o J1  for  the  system  with  all  admissible 
1  ci 

measurements  present,  (i.e.  all  q^  =  1  in  (B3)). 


v  -  -  -  *  »  -  v.  » 


>  -  1  •  ■ 
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y  Appendix  C:  Hoop  Antenna  Model 

•_  i 

Figure  C-l  is  a  schematic  of  the  hoop  antenna  model  (8.1). 


Table  C-l  describes  the  actuator  types  and  locations. 


Actuator  (TorquerJ  # 


Location 


Direction  of  Torque 
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Table  C-2  describes  the  sensor  types  and  locations. 


Sensor  # 


Inertial  Angle 


Relative  Linear  Disp.  Between 

II 


Location  Direction 


6  and  2 


9  and  2 


10  and  2 


Inertial  Angle 


Relative  Linear  Displ.  between  101  and  10 


Inertial  Angular  Rate 


107  and  10 


113  and  10 


119  and  10 


Table  C-2:  Sensor  description 


Table  C-3  describes  the  output  types  and  locations  (the  Y  vector 


in  (8.1). ) 


Output  # 


Location  Direction 


Inertial  Angle 


Relative  Angle  Between 


10  and  2 


Inertial  Angle 

Relative  Linear  Displ.  Between 


6  and  2 


9  and  2 


10  and  2 


101  and  10 


107  and  10 


113  and  10 


119  and  10 


Table  C-3:  Output  description 
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The  contents  of  the  matrices  (A,  B,  C,  Q,  M,  R,  W,  V)  specified 
in  (8.1)  are  described  below. 

12 

10 
10 
3 
3 


"°  '10  „  ' 

(20 

0 

]  (Cl  a)  A  = 

n 

| 

2 

■“  _2?aj  o  i3 

/ 

B  = 

?  6 

BE 

0 

! 

1 
O  J 

o 

_ 1 

5 

BR 

w  =  diag  [.40579,  7.2090,  7.2362,  13.277,  44.834,  132.14, 
142.66,  445.01,  448.69,  775.86]  (radians2/sec2) 

(Clb)  I^q  =  10x10  Identity  Matrix,  1^  =  3x3  Identity  Matrix 

2cu--d1ag  [.0127,  .053699,  .0538,  .07286,  .26283,  .45981, 
.47777  ,  .84381,  .8473,  1.1142]  (radians/sec) 


(Clc)  M  = 


JO 

ME 


3 

MR 


MER 

10 


MRR 
3  " 


10  10  3  3 

C  =  27  [CE  0  CR  0] 


The  contents  of  the  above  defined  submatricies  are  listed  on  the  next 
several  pages. 
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System  Performance  Requirements: 


vim  =  -02°- 1  =  1-6 

Yim  =  .0005  meters,  i  =7,8 . 27 

u .  =  1  n-m 

im 


Weights: 


=  1,  i  =  1,  2,  4,  5 
=  .01 ,  i  =3,  6 
=  .1 ,  i  =7,  8,  ...  27 
=  10"5,  i  =  1,  ...  12 


Q  =  diag  [82.07,  82.07,  .8207,  82.07,  82.07,  .8207, 
400,000,  400,000]  105 

21  elements 


R  =  10"5I12  ,  W  =  10"8I12 


7.6154  x  10"9I. 
2.5  x  10'7Ig 
2.5  x  10”7Ilo 


=  4.7597  x  10  3I, 
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A^Ct  AND  COMPARISON  OF  ALGORITHM  ASP-1  RESULTS 
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AVi  AND  V.  COMPARISON  FOR  ONE  SENSOR  DELETED 


FIG  C-l :  HOOP  COLUMN  ANTENNA  SCHEMATIC 
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